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Transcriptional Intermediary Factor-2 



Field of the Invention 

The present invention relates to a nuclear receptor (MR) transcriptional 
mediator. More specifically, isolated nucleic acid molecules are provided 
encoding transcriptional intermediary factor-2 (TIF2). Recombinant methods for 
making TIF2 polypeptides are also provided as are screening methods for 
identifying agonists and antagonists of the activation function AF-2 of nuclear 
receptors, as well as TIF2 antibodies. Also provided are screening methods for 
identifying agonists and antagonists of TJF2 ADl activation domain activity, as 
are provided screening methods for identifying agonists and antagonists of TIF2 
AD2 activation domain activity. 

Background of the Invention 

Activators that enhance the initiation of transcription by RNA polymerase 
B (II) are composed of at least two functional domains: a DNA binding domain 
and an activating domain (M. Ptashne, Nature 535:683-689 (1988); P.J. Mitchell 
etal., Science 2*5:371-378 (1989)). These two domains are generally separable 
functional units and each can actually be interchanged with the complementary 
region of an unrelated activator, thereby creating functional chimeric activators (S. 
Green et ai, Nature 525:75-78 (1987)). 

A number of structure-function analyses of eukaryotic transcriptional 
have been performed, focussing primarily on the yeast GAL4 and 
GCN4 proteins and on members of the nuclear receptor family. GAL4 and GCN4 
proteins activate transcription by binding to specific upstream activation sequence, 
which have many of the characteristics of higher eukaryotic enhancer elements (K 
Struhl, Cell ^:295-297 (1987)). The herpes simplex activator VP16 represents 
another type of activator, which activates transcription by binding to the DNA- 
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bound octamer transcription factor rather than binding to the DNA directly (T. 
Gerster elal., Proc. Natl. Acad. Sci. USA 55:6347-6351 (1988)). 

The nuclear receptor family, which includes receptors for steroid 
hormones, thyroid hormones, vitamin D, and the vitamin A derivative retinoic 
acid, are also transcriptional enhancer factors which bind DNA directly in the 
presence of their cognate ligand by recognition of specific enhancer elements, i.e., 
hormone- or ligand-responsive elements (R.M. Evans, Cell 240:889-895 (1988)). 
These cognate ligands tend to be small, hydrophobic molecules, including steroid 
hormones such as estrogen and progesterone, thyroid hormone, vitamin D, and 
various retinoids (S. Halachmi et al, Science 264. 1455-1458 (1994); 
Gronemeyer, H. and Laudet, V., Protein Profile 2: 1 1 73 - 1 3 08 ( 1 995)). 

Despite their small size and apparently simple structure, however, the 
cognate ligands associated with NRs are known to elicit a wide range of 
physiological responses. Adrenal steroids for example, such as Cortisol and 
aldosterone, widely influence body homeostasis, controlling glycogen and mineral 
metabolism, have widespread effects on the immune and nervous systems, and 
influence the growth and differentiation of cultured cells. The sex hormones 
(progesterone, estrogen and testosterone) provoke the development and 
determination of the embryonic reproductive system, masculinize/feminize the 
brain at birth, control reproduction and related behavior in adults and are 
responsible for development of secondary sex characteristics. Vitamin D is 
necessary for proper bone development and plays a critical role in calcium 
metabolism and bone differentiation. Significantly, aberrant production of these 
hormones has been associated with a broad spectrum of clinical disease, including 
cancer and similar pathologic conditions. 

All NRs display a modular structure, with five to six distinct regions, 
termed A-F. The N-terminal A/B region contains the activation function AF-1, 
which can activate transcription constitutively. Region C encompasses the DNA 
binding domain (DBD), which recognizes cognate exacting elements. Region E 
contains the ligand-binding domain (LBD), a dimerization surface and the ligand- 
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dependent transcriptional activation function AF-2 (reviewed in Mangelsdorft, 
D.J. etal., Ce// 55:835-839 (1995a); Mangelsdorft & Evans. Cell 83: 84 1-850 
(1995b); Beato, M. et al., Cell 55:851-857 (1995); Gronemeyer & Laudet, 
"Transcription Factors 3: Nuclear Receptors", in Protein Profile, vol. 2, Academic 
Press (1995); Kastner, P. el al., EMBO J. 77:629-642 (1992); Chambon, P., 
FASEB J 70:940-954 (1 996)). 

Several classes of domains in activators are capable of mediating 
transcriptional activation. Yeast activators GAL4 and GCN4 and herpes simplex 
VP 16 all contain activation domains that are composed of acidic stretches of 
amino acids, which may act by forming amphipathic a helices (LA. Hope et al, 
Cell *5:885-894 (1986); J. Ma el al.. Cell 4*847-853 (1987); E. Giniger et al., 
Nature 330:670-672 (1987); S.J. Triezenberg el al., Genes Dev. 2:718-729 
(1988)). The activation functions of human Spl and CTF/NFI proteins contain 
glutamine- and proline-rich areas, respectively (A. J. Courey etal., Cell 55:887- 
898 (1988); N. Mermod el al., Cell 55:741-753 (1989)). Studies with steroid 
hormone receptors have shown that both the N-terminal A/B domain and the C- 
terminal hormone binding domain (HBD) contain transcription activation 
functions (AFs) (M.T. Bocquel etal., Nucl. Acids Res., 77:2581-2595 (1989); L. 
Tora et al., Cell 59:477-487 (1989)). The AFs of the human estrogen receptor 
(hER) do not contain stretches of acidic amino acids (S. Halachmi et al, Science 
2<*:1455-1458 (1994)). Conversely, however, the human glucocorticoid receptor 
(hGR) contains two activation functions, x-1 (located in the A/B domain) and x-2 
(located in the N-terminal region of the HBD), both of which are acidic (S.M. 
Hollenberg et al., Cell 55:899-906 (1988)). 

From the results of studies on transcriptional interference/squelching 
between nuclear receptors and on homo- and heterosynergistic stimulation of 
initiation of transcription from minimal promoters by the activation functions 
present in hER (AF-1 and AF-2) and the acidic activator VP16, it has been 
proposed that AFs may activate transcription by interacting with different 
components of the basic initiation complex (Bocquel et al., Nucl. Acids. Res. 
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77:2581-2595 (1989); Meyer etaL, Cell 57:433-442 (1989); L. TonetaL, Cell 
59 -AT7-4S7 (1989)). Studies of the transcriptional interference/squelching 
properties of AADs, hER AF-1 and hER AF-2, however, showed that both hER 
AF-1 and AF-2 can squelch acidic activators, such as VP16, but that the converse 
was not true, i.e., AADs do not squelch hER AF-1 or AF-2. Moreover, hER 
AF-1 and AF-2, which are clearly distinguished by their synergistic properties, 
nevertheless squelch each other (D. Tasset et al., Cell 62: 1 177-1 187 (1990)). 

Based on these results, it was proposed that a string of transcriptional 
intermediary factors (TIFs) exists, interposed between enhancer factors and the 
basic transcriptional factors. For example, AF-1 and AF-2 have been suggested 
to contact the string of TIFs at functionally equivalent points, while AADs are 
believed to interact at an earlier point in the series (D. Tasset et al. Cell 62: 1 177- 
1187(1990)). 

Several putative coactivator TIFs for NR AF-2s have been characterized 
(see Chambon, P., FASEB J 70:940-954 (1996); Glass, C.K. ei al., Current Opin. 
Cell Biol. 9.222-232 (1997); Horwitz, K.B. etaL, Mol. Endocrinol. 70.1167- 
1 177 (1996) for recent reviews). In particular, LeDouarin, B. et al, EMBO J. 
75/6701-6715 (1996) have demonstrated that a 10-amino acid fragment of TIFla 
is necessary and sufficient to mediate interaction with RXR in a ligand- and AF-2 
integrity-dependent manner. Notably, within this TIFla fragment, they identified 
a LxxLLL (SEQ ID NO: 13) motif, termed NR box, whose integrity is required for 
interaction with nuclear receptors, and pointed out that this motif is conserved in 
several other putative coactivators (LeDouarin, B. et al, EMBO J. I J/670 1-671 5 
(1996)) Whereas TIFla and several other putative coactivators do not, or only 
very poorly, stimulate transaction by NRs in transiently transfected mammalian 
cells, theTIF2/SRC-l family (Onate, S.A. etaL. Science 270:1354-1357 (1995); 
Voegel, J.J. etaL, EMBO J. 75:3667-3675 (1996)), the CBP/ P 3 00 family (Kamei, 
Y. etaL, Cell 55:403-414 (1996); Chakravarti, D. etaL, Nature 5:99-103 (1996)! 
Hanstein, B., etaL, Proc. Natl. Acad. Sci USA 93:11540-11545 (1996); Smith,' 
C.L. etaL, Proc. Natl Acad Sci. USA 93:8884-8888 (1996); for recent reviews 
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see Eckner, R. , Biol. Chem. 377:685-688 (1996); Janknecht &Hunter, Current 
Biol. 5:951-954 (1996b); Shikama, N. el al. Trends in Cell Biol. 7:230-236 
(1 997)) and the androgen receptor coactivator ARA70 (Yeh & Chang, Proc. Natl. 
Acad. Sci. USA 93:55 17-5521 (1996)) have been unequivocally shown to enhance 
AF-2 activity. 

In addition to binding NRs, CBP/p300 can also interact directly with SRC- 
1 (Kamei, Y. etal.. Cell 55:403-414 (1996); Yao, T.P. et al., Proc. Natl Acad. 
Sci. USA 95:10626-10631 (1996)) and both factors have been shown to exert 
histone acetyltransferase activity (Bannister & Kouzarides, Nature 3**641-643 
(1996); Ogryzko, V.V. etal., Cell 87:953-959 (1996)). Moreover, CBP/p300 can 
recruit p/CAF which is itself a nuclear histone acetyltransferase (Yang XJ etal. 
Nature 382:3 19-324 (1996)). However, apart from interacting with coactivators 
in a ligand-dependent manner, NRs have also been shown to interact, often in a 
Hgand-independent fashion, directly or indirectly with components .of the 
transcriptional machinery, suds as TFIIB, TBP, TAFs, or TFIIH (Baniahmad et 
al, (1993)); Jacq, X. et al. Cell 79:107-1 17 (1994); Schulman, IG. etal.Mol 
Cell. Biol 75:3807-3813 (1996); May, M.etal, EMBOJ. 75:3093-3 104 (1996); 
Mengus, G. etal, Genes&Dev. 77:1381-1395 (1997)). 

Hong, H. et al, Proc. Natl. Acad. Sci. USA 93:4948-4952 (1996) 
originally described a partial cDNA of the mouse homologue of TIF2, named 
GRIP1, and recently reported the isolation of a full length GRJTP1 cDNA (Hong 
H. et al, Mol Cell Biol. 17:2735-2744 (1997)). Using the yeast Saccharoses 
cerevisiae as a model system, they have shown that transcriptional activation by 
TR, RAR and RXR, could also be stimulated by GRIP1 compression, which 
suggests that TIF2/GRIP1 could be a general coactivator for NRs (Hong, H. et 
al, Mol Cell Biol. 17:2735-2744 (1997)). 

The overall picture emerging from multiple recent studies on the 
mechanisms by which nuclear receptors modulate target gene transcription 
involves three subsequent steps, (i) the ligand-induced transconformation of the 
NR LED, which results in (ii) the dissociation of corepressors and formation of 
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TIFs/coactivator complexes, which themselves (iii) through interaction with 
additional downstream factors (e.g., CBP, p300) modulate the acetylation status 
of core histones and, thus, chromatin condensation/decondensation. Histone 
acetylation on its own is, however, insufficient for transcription activation (Wong 
et al. t (1997)), and a simultaneous or subsequent fourth event comprises the direct 
and/or indirect recruitment of elements of the transcription machinery (e.g., TFIB, 
TBP, TAFs, TFTTH; Jacq, X. etal, Cell 79: 107-1 17 (1994); Schulman, IG. etal, 
Mql. Cell Biol. 70:3807-3813 (1996); May, M. etal, EMBO J. 75:3093-3104 
(1996); Mengus, G. etal, Genes &Dev. 77:1381-1395 (1997)). Note that such 
interactions do not need to be Iigand-dependent, if the primary function of the 
liganded LBD (AF-2) is to regulate DNA accessibility through chromatin 
remodeling. Indeed, several of the reported interactions between NRs and general 
transcription factors occur in a ligand-independent manner. Accordingly, there is 
a need in the art for the isolation and characterization of transcriptional 
intermediary factors. 



Summary of the Invention 

By screening 340,000 clones of a human placenta cDNA expression library 
with an estradiol-bound estrogen receptor probe, the present inventors have 
identified a cDNA clone containing the gene encoding transcriptional intermediary 
factor 2 (TIF2). By the invention, TIF2 has been shown to exhibit all the 
properties expected for a TIF/mediator of AF-2: it interacts directly with the 
LBDs of several NRs in an agonist- and AF-2-integrity-dependent manner in vitro 
and in vivo, harbours an autonomous AF, relieves NR autosquelching, and 
enhances the activity of NR AF-2s when overexpressed in mammalian cells. 

Thus, in one aspect, the present invention provides isolated nucleic acid 
molecules comprising a polynucleotide encoding TTF2 whose amino acid sequence 
is shown in Figure 1 (SEQ ID NO:2) or a fragment thereof. In another aspect, the 
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invention provides isolated nucleic acid molecules encoding TIF2 having an amino 
acid sequence as encoded by the cDNA deposited as ATCC Deposit No. 97612. 

The invention further provides an isolated nucleic acid molecule that 
hybridizes under stringent conditions to the above-described nucleic acid 
molecules. The present invention also relates to variants of the nucleic acid 
molecules of the present invention, which encode fragments, analogs or 
derivatives of the TIF2 protein, e.g., polypeptides having at least one biological 
activity that is substantially similar to at least on biological activity of the TIF2 
protein. 

The present invention is further directed to isolated nucleic acid molecules 
that encode a cytoplasmic TIF2 polypeptide. Methods for generating nucleic acid 
molecules that encode a cytoplasmic TIF2 polypeptide include mutating or 
deleting the NLSs-coding N-terminal region of the nucleotide sequence shown in 
Figure 1 (SEQ ID NO:l). Preferably, nucleic acid molecules encoding a 
cytoplasmic TIF2 polypeptide will be fragments having a deletion in all or part of 
the N-terminal NLSs Coding region. By the invention, the cytoplasmic TIF2 
polypeptides described herein display at least one biological activity that is 
substantially similar to at least one biological activity of TIF2. 

Further embodiments of the invention include isolated nucleic acid 
molecules comprising a polynucleotide having a nucleotide sequence at least 90% 
identical, and more preferably at least 95%, 96%, 97%, 98%, or 99% identical to 
the above described nucleic acid molecules. 

The present invention also relates to vectors which contain the above- 
described isolated nucleic acid molecules, host cells transformed with the vectors 
and the production of TIF2 polypeptides by recombinant methods. 

The present invention further provides isolated TIF2 polypeptides having 
the amino acid sequence shown in Figure 1 (SEQ ID NO:2). In a further aspect, 
isolated TIF polypeptides are provided having an amino acid sequence as encoded 
by the cDNA deposited as ATCC Deposit No. 97612. 



WO 98/02455 



PCT/US97/12100 



-8- 



Screening methods are also provided for identifying agonists and 
antagonists of nuclear receptor AF-2 function, for identifying agonists and 
antagonists of TIF2 AD1 activity, and for identifying agonists and antagonists of 
TIF2 AD2 activity. Also provided are TIF2 antibodies. 

Brief Description of the Figures 

Figure l(a-b). The nucleotide (SEQ ID NO: 1) and amino acid (SEQ ID 
NO:2) sequences of the transcriptional intermediary factor-2 (TIF2) protein. This 
protein has a deduced molecular weight of about 160 kDa. The amino acid 
sequence of the functional coactivator TIF2.1 protein fragment is shown from 
amino acid residue 624 to residue 1287. 

Figure 2(a-c). TIF2 is the 160-kDa nuclear-receptor-interacting factor. 

(a) GST pull-down experiments identify a 160-kDa protein interacting 
with Iiganded estrogen receptor (ER) and retinoic acid receptor (RAR)-a ligand 
binding domains (LBDs) (ER(DEF) and RARa(DEF), respectively). Note that 
less material was run in lane 5 than in lanes 1-4. 

(b) Immunodepletion followed by Far-Western detection demonstrates 
identity of TIF2 with the biochemically identified 160-kDa protein. Open triangle, 
TTF2; arrowhead, TIF1; circle, antibody crossreaction to GST-ER(DEF). The pa- 
TIF2-immunodetected species smaller that TIF2 (lanes 2 and 6) most probably is 
a degradation product of TTF2, as it is removed by immunodepletion with ma- 
TIF2 (lanes 4 and 8). 

(c) Northern blotting reveals a «9-kb TIF2 transcript in various human 

tissues. 

Methods, (a) In vivo 3S S-Met-labeled MCF7 whole cell extracts 
(Cavailles, V. etal,Proc. Natl. Acad. Sci. USA 97:10009-100013 (1994)) twice 
precleared with GST-loaded glutathione sepharose, were incubated (Le Douarin, 
B. etal, EMBOJ. HMIQ-IW (1995)) with GST, GST-hER (DEF) or GST- 
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hRARa(DEF), in presence or absence of 10" M E2 or T-RA. Bound proteins 
were recovered with SDS sample buffer and revealed by fluorography (Amplify, 
Amersham) of SDS-polyacrylamide gels. 

(b) HeLa whole cell extracts (2 ml in 500 mM NaCl, 250 mM TrisHCl 
pH 7.5, 20% glycerol, 5 mM DTT), were precleared with protein-G sepharose 
(400 ul) and treated with protein-G sepharose (3 x 400 ul) loaded with ma-TIF2 
(raised against a synthetic peptide encompassing amino acids E624-Q643 coupled 
to ovalbumin) or non-specific mouse IgG serum. After further clearing with 
protein-G sepharose (400 ul), the supernatant was incubated (LeDouarin B et 
al, EMBOJ. 7*2020-2033 (1995)) with GST-hER(DEF) in presence or absence 
of E2(10- 6 M). Retained proteins were recovered with SDS sample buffer 
separated by SDS-PAGE and electroblotted on nitrocellulose membranes. Far 
Western blotting was as described (Cavailles, V. etal.,Proc. Natl. Acad Sci USA 
91. 10009-100013 (1994)). For immunoblotting rabbit polyclonal antiserum (p«- 
TIF2), raised against purified (Chen, Z-P. et al, J. Biol. Chem. 269:25110-25116 
(1994)) recombinant*, ^//-expressed His-tagged TIF2.1, was used. pa-TIF2 
and rabbit polyclonal pcc-TIFl were diluted 1.2000 for ECL-based Western 
blotting (Amersham). All constructs used in this study were verified by DNA 
sequencing. 

(c) Human Northern blot (Clonetech, No 7760-1; Lot 5x332) was 
revealed with «P-labeled Tn?2 , Tq proponiomte ^ 

membrane was rehybridized with 32 P-labeled P-actin cDNA (Clonetech). 

Figure 3(a-b). Amino acid sequence of TIF2: homology with SRC-1 
indicates the existence of a novel family of NR mediators. 

(a) Alignment and amino acid sequences of TEF-2 (SEQ ID NO:2) and the 
steroid receptor coactivator SRC-1 (SEQ ID NO:3) (Onate, S. A. et al Science 
270.1354-1357 (1995)). Two charged clusters rich in acidic and basic amino acid 
resuiues, three serine/threonine (SAr)-rich regions and one glutamine-rich region 
are highlighted. The N-terminal charged cluster contains putative bipartite nuclear 
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localization signals (NLSs) (overlined). The regions encoding TIF2.1 (amino 
acids 624 to 1287; functional coactivator fragment) and dnSRC-1 (amino acids 
865 to 1061; dominant negative fragment) are indicated. An asterisk identifies the 
TIF2 stop codon. Note that TIF2.1 and dnSRC-1 do not overlap, indicating that 
dnSRC-1 may possibly contain a NR-interacting region distinct from that of 
TTF2.1. 

(b) Schematic comparison of TIF2 and SRC- 1. Percent identities 
(similarities in parentheses) of homologous regions are indicated. The N-terminal 
charged cluster harbouring the putative NLS and the C-terminal S/T-rich region 
of TIF2 are not, or only weakly, conserved in SRC-1. 

Methods. 340,000 clones of a human placenta cDNA AEXlox 
expression library were screened with a 32 P-labeled GST-hER(DEF) probe in 
presence of lO^M E2 using the Far-Western technique (Cavailles, V. et al. , Proc. 
Natl. Acad. Sri USA 97:10009-100013 (1994)). The 1992-bp insert 
corresponding to the initial clone (TIF2. 1) was used to rescreen the same library. 
Five highly overlapping cDNA inserts covered a region of 6 kb containing a 
1,464-aminoacidORF. All inserts were sequenced on both strands. Transient 
expression of the assembled cDNA inserts encompassing the predicted ORF 
yielded a 160-kDa protein. 

Figure 4(a-n). In vivo and in vitro interactions of TIF2 with nuclear 
receptors. 

(a) Overexpressed TIF2 protein is mostly localized in discrete nuclear 
bodies and excluded from nucleoli. A superposed image of Hoechst DNA staining 
and TIF2 immunostaining is shown. 

(b-i) Cytoplasmic TIF2. 1 interacts in an agonist-dependent manner with 
nuclear receptors in mammalian cells. Light staining indicates TIF2.1-NO 
colocalization. 

(k-n) TIF2.1 directly interacts in vitro in an agonist-dependent manner 
with nuclear receptors, and point mutations within the AF-2 activation domain 
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(AD) core abolish this interaction. WT, wild-type. Ligand concentrations for n: 
9C-RA, T-RA and T3, 10" M; E2, 5xl0°M; OHT, SxlO^M. The smaller 
immunodetected polypeptide is a degradation product of TIF2. 1. Note that the 
anti-TIF2 serum weakly crossreacts with GST-hER(DEF). 

Methods, (a-i) Cos-1 cells were transiently transfected with TIF2. 1 (10 
Hg) either (a) alone or (b-i) in addition with the indicated NR expression vectors 
(10 ug, except RARa, lug) in absence or presence of the cognate ligand (lO'lVl 
exceptR5020, 10«M). In d-fHEO (Webster, N.J. et ai, Cell 54: 199-207 (1988)) 
was used. Immunocytofluorescence assays were as described (Kastner, P. et al. 
EMBO J. 11: 629-642 (1992)). Images were recorded by confocal laser 
microscopy. 

(k-n) GST interaction assays with £ co//-expressed recombinant TIF2. 1 
(Fig. 2b) were performed as described (Le Douarin, B. et at. , EMBO J. 7*2020- 
2033 (1995)). Bound proteins were revealed by Western blotting with pa-TIF2 
antiserum (dilution 1:30,000), equal loading of affinity matrices was verified by 
SDS-PAGE and Coomassie staining. 'Input' lanes contain one third of TIF2.1 
input. 

Figure 5 (a-e). TIF2 contains an autonomous AF, "antisquelches", and 
stimulates NO-AF2 activity in an agonist-, promoter- and cell-dependent manner. 

(a) Increasing amounts of GAL-TIF2. 1 fusion protein (lanes 2-4) activates 
transcription of a cognate reporter in transfected cells. Fold-induction is given 
below the CAT assays. 

(b) TIF2.1 partially reverses transcriptional auto interference of ER. 
Normalized CAT expression (mean ± s.e. of 4 independent experiments) is shown. 
Open circles, +E2, +TIF2; squares, +E2, -TJP2; crosses, +OHT, +TIF2. 

(c) TIF2 enhances transaction mediated by some NR AF-2s, but not 
that mediated by other transcription factors. Mean TIF2 stimulations of 3 
independent experiments are given (variation < 13%). Ligands: lanes 3-4, E2; 
lanes 7-8, DHT (dihydrotestosterone); lanes 11-12, R5020; lanes 15-16, T-RA.' 
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(d) TIF2 enhances PR-mediated transcriptional activation from both a 
minimal (GRE-TATA) and a complex (MMTV) promoter; this stimulation is 
significantly greater in Cos-1 than in HeLa cells. 

(e) TIF2 greatly enhances agonist-induced activation by ER in Cos-1 and 
more weakly in HeLa cells. Note that the weak, seemingly ligand-independent, 
TEF2-induction of ER (compare lanes 1 with 2 and 7 with 8) is due to residual 
estradiol in the culture medium. In d and e, TIF2 inductions of * 3 experiments 

r 

are shown (variation <. 10%). 

Methods. With the exception of GAL-TIF2. 1 , TTF2. 1 and TIF2, the 
construction of reporter plasmids and expression vectors has been described 
(Meyer, M-E. et al, Cell 57 :433-442 (1989); Bocquel, M-T. et al. Nucl. acids 
Res. 77:2581-2595 (1989); Tasset, D. et al, Cell 52:1177-1187 (1990); 
Gronemeyer, H. and Laudet, V., Protein Profile 2:1173-1308 (1995); Webster, 
N. J. etal. Cell 5*199-207 (1988); Strahle, U. et al, EMBO J. 7:3389-3395 
(1988); Seipel, K. etal, EMBO J. 77:4961-4968 (1992); Nagpal, S. etal, EMBO 
J. 72:2349-2360 (1993); Chen, J-Y. etal, EMBO J. 14:1 187-1 197 (1995)). CAT 
assays were performed as described (Bocquel, M-T. et al, Nucl Acids Res. 
77:2581-2595 (1989)). 

(a) HeLa cells were cotransfected with 1 ug ( 1 7m) s -pG-C AT and 1 0 
ug GAL(1-147) or 1,3 and 10 ug GAL(1-147)-TIF2.1, respectively. 

(b) HeLa cells were cotransfected with 5 ug Vit-tk-CAT and the 
indicated amount of HEG0, with or without 5 ug TIF2.1 in the presence of 10 -6 
M E2 or OHT. CAT activity is given relative to that induced by 100 ng HEGO in 
presence of E2. 

(c) HeLa cells were cotransfected with 1 ug 17m-tk-CAT and 1 ug of 
the indicated GAL-fiision vectors with or without the addition of 3 fig TIF2 
expression vector in presence or absence of 10"*M ligand. 

(d) HeLa (lanes 1-12) or Cos-1 (lanes 13-18) cells were transfected 
with 5 ug GRE-TATA-CAT (lanes 1-6) or 1 ug MMTV-CAT (lanes 7-18) 
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together with 1 ug hPR with or without 3 ug TIF 2 in presence or absence of 1 0* 
M of the indicated ligand. 

(e) Cos-1 cells were cotransfected with 1 ug Vit-tk-CAT and 1 ug 
HEGO with or without 3 ug TIF2 in presence or absence of 10" 6 M of the 
indicated ligands. 



Figure 6 (a-b). Schematic representation of reporter genes (A) and 
receptor expression vectors (B) (see the Materials and Methods section of Nagpal 
et al., EMBO J J 2(6) .2349-2360 (1993) for a detailed description of 
construction). Sequences of mCRBPII (SEQ ID NO:l 1) and mCRBPII(17 m - 
ERE)/CAT (SEQ ID NO: 1 1) are indicated. Minus and plus numbers are with 
respect to the RNA start site (+1). In (B), the various regions (A-F) of wild-type 
RARs and RXRs, as well as their truncation mutants, substitution mutants and 
chimeric receptor constructs are schematically represented (not to scale) (see 
Zelent et al., Nature 339:714-717 (1989); Leid et al., Trends Biochem. Sci. 
77,427-433 (1992); Leid et al., Cell 65:377-395 (1992); Nagpal et al., Cell 
70,1007-1019 (1992); and Allenby et al., Proc. Natl. Acad. Sci. USA 90,30-34 
(1993)). Numbers indicate the amino acid positions in the wild-type receptor. 
The positions of the amino acid substitutions are indicated with an arrow. 

Figure 7(a-e). Mapping of TIF2 domains. 

(a) Schematic representation of functional domains identified in TIF2. 
The various TIF2 constructs are denoted; expressed residues are given in 
parentheses. Bold lines indicate expressed sequences. Constructs that score 
positive or negative for NR interaction, transactivation or CBP binding are 
identified on the right by and «-» signs respectively; nd, not determined. 

(b) Mapping of the nuclear receptor-interacting domain of TIF2. 
Glutathione-S-transferase (GST) pull-down experiments were performed with 35 S- 
Iabeled //,-v/fro-translated TIF2 polypeptides and bacterially produced GST, GST- 
hERcc(DEF) and GST-hRARa(DEF) in the presence or absence of 1 0"* M of the 
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cognate Hgand (E2, estradiol for ER; RA, zW-trans-retinoic acid for RAR). 

(c) Analysis of the transcriptional activity of GAL-TIF2 fusion proteins. 
Cos-1 and HeLa cells were transfected with 3 ug of plasmids expressing different 
regions of TEF2 fused to the DNA-binding domain of the yeast transcription factor 
GAL4 together with 1 ug of the (17m) 5 -G-CAT reporter plasmid. CAT assays 
were performed as described (Bocquel, M.T. et al., NucL Acids Res. 77:2581- 
2595 (1989)). Quantitative data on CAT reporter expression were obtained either . 
by phosphoimager analysis (BAS2000, Fuji) of ,4 C-labeled CAT reaction products 
separated by thin-layer chromatography, or using the CAT ELISA Kit (Boehringer 
Mannheim). In all cases, CAT activities were normalized to the p-galactosidase 
concentrations resulting from cotransfection of 1 ug of pCMVpGal (gift from 
T. Lerouge) as internal control. Fold inductions above the GAL4 DBD value are 
indicated. The mean and standard deviation of at least three experiments are 
shown. A representative Western blot, illustrating the expression levels of the 
GAL4-TIF2 fusion proteins, expressed from 10 ug of the corresponding 
expression vectors, is shown on the left. The blot was revealed with mouse 
monoclonal antibodies 2GV3 and 3GV2 specific for GAL4 DBD and 2GV4B7 
specific for VP 16 activation domain. 

(d) Mapping of the CBP-interacting domain of TIF2. GST pull-down 
experiments were performed with 35 S-labeled m-vZ/ro-translated TIF2 
polypeptides and bacterially produced GST and GST-CBP (expressing CBP 
residues 1872 to 2165). 

(e) Two hybrid analysis of the CBP-TIF2 interaction in mammalian cells 
in vivo. HeLa cells were transfected with 0.2 ug of the GAL4 or GAL4-CBP 
(expressing CBP residues 1872 to 2165) expression vectors together with 0.2 ug 
of the VP16 or VP16-TIF2 expression vectors in the presence of 1 ug of (17m) 5 - 
TATA-CAT reporter plasmid. Fold induction relative to the GAL-CBP activity 
is indicated. The mean of three experiments is shown; in each case, values varied 
by less than ±20%. 
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Figure 8(a-e). Mapping of the TIF2 nuclear receptor interacting domain 

(NID). 

(a) Alignment of the TIF2 NID (SEQ ID N0.2) with the corresponding 
regions of SRC-1 (SEQ ID NO:3) and P/CIP (SEQ ID NO:5) and description of 
NID mutations. The three conserved regions are displayed with the corresponding 
amino acid numbers of hTIF2 or full-length hSRC-1 (F-SRC-1); the leucines 
pertaining to the three NR box motifs (I, H, III) are boxed. The various deletion 
and leucine-to-alanine point mutation constructs are denoted. 

(b) Alignment of the TIF2 (SEQ ID NO:2) NR boxes with NR boxes 
identified in several cofactors: TIFloc (SEQ ID NO:6), RIP 140 (SEQ ID NO:7), 
and TRIP3 (SEQ ID NO:8). The conserved leucines are boxed. 

(c-d) Interaction of TIF2 NID mutants with NRs in vitro. GST affinity 
chromatography experiments were carried out with 3S S- labelled /«-v/> 0 -translated 
GAL4 DBD fusions of TIF2 deletion mutants (c) or TEF2. 1 point mutants (d) and 
bacterially expressed GST and GST fusions of the ER(DEF) and RAR(DEF) in 
the absence or presence of 10« M estradiol or dUrarts-retinoic acid, respectively. 
For quantification of point mutant interactions, see below. 

(e) Effect of TIF2 NID point mutations on stimulation of NR AF-2 
activity. Cos-1 cells were cotransfected with 1 pg of the (17m) 5 -TATA-CAT 
reporter, 0.2 ug of GAL-hERa(EF) or GAL-mRXRa(DE), and 2.5 pg of the 
T1F2. 1 wildtype or mutated fragments, as indicated. The reporter gene activation 
relative to the TIF2.1 wildtype activity and in presence of 1 0" 6 M estradiol (E2) 
or *\\-trans-retinoic acid (RA), respectively, is indicated for each mutant (black 
bars); for comparison, in vitro binding of the respective mutants relative to TIF2. 1 
wildtype binding in presence of ligand is indicated by the white bars. Each bar 
represents the mean value obtained from at least three (interaction) or at least four 
(transactivation) experiments, respectively; standard deviations are indicated. 
Note that the absolute values for TIF2. 1 wildtype activity varied by ±16% when 
cotransfected with GAL-hERa(EF) and by ±34% when cotransfected with GAL- 
mRXRa(DE). In the /«-v/>o-interaction assays, the affinity of the TIF2.1 
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wildtype standard varied by less than ±25%. Expression levels of TIF2 mutants 
in the cells were verified by Western blot (not shown) with mouse monoclonal 
antibody 3Ti3Fl, which is directed against an epitope outside the mutated area. 

Figure 9(a-c). Mapping of the TIF2 activation function- 1 (AF-1) and 
interaction of the AF-1 domain with CBP. 

(a) Alignment of the TIF2 AF-1 with the corresponding region of SRC- 1 
(SEQ ID NO:3) and P/CIP (SEQ ID NO:9). Description of TIF2 AF-1 deletion 
mutants and their properties. The regions of TIF2 and hSRC-1 predicted to fold 
into a-helices are boxed (PHD program). GAL-TIF2 constructs that score 
positive or negative for transactivation of a GAJL4 reporter are identified on the 
right by "+" and "-" signs, respectively; nd, not determined. 

(b) Transcriptional activation of TIF2 AF-1 mutants. Cos-1 and HeLa 
cells were cotransfected with 3 ug of plasmids expressing different mutants of the 
TIF2 AF-1 fused to the DNA-binding domain of the yeast transcription factor 
GAL4 together with 1 ug of the (17m) r G-CAT reporter plasmid. Fold inductions 
above the activation seen with the GAL4 DBD alone are indicated. The values 
represent the mean of at least three experiments. Note that all GAL4-TIF2 fusion 
proteins were expressed to similar levels, as revealed by Western blot with 
antibodies directed against GAL4 DBD (data not shown). 

(c) Interaction of TIF2 AF-1 mutants with CBP in vitro. GST pull-down 
experiments were performed with 35 S-labeled w-v/Jro-translated GAL-TIF2 fusion 
proteins and bacterially produced GST and GST-CBP. Note, that the GAL4 DBD 
on its own does not interact with the GST-CBP affinity matrix. 

Figure 10(a-c). Identification of TIF2 AF-1 mutants which are impaired 
in both transcriptional activation and interaction with CBP. 

(a) Transcriptional activation by TIF2. 13 and TIF2. 13 mutants. Cos-1 and 
HeLa cells were cotransfected with 3 ug of plasmids expressing the TEF2.13 
region and the indicated TIF2. 13 mutants fused to the DNA-binding domain of the 



WO 98/02455 



PCT/US97/12100 



-17- 



yeast transcription factor GAL4 together with 1 ug of the ( 1 7m) s -G-CAT reporter 
plasmid. Fold inductions above the GAL4 DBD 1 -fold value are indicated. The 
mean and standard deviation obtained from at least four experiments are shown. 
The expression levels of the GAL4-TIF2.13 fusion proteins were confirmed by 
western blotting (data not shown). 

(b) Interaction of TIF2.13 wildtype and TIF2.13 mutants with CBP in 
mammalian cells revealed by two hybrid analysis. HeLa cells were transfected 
with 0.2 ug of GAL4 or GAL-CBP expression vectors together with 0.2 ug of 
VP 16 or VP16-TIF2.13 expression vectors in the presence of 1 ug of (17m) 5 - 
TATA-CAT reporter plasmid. Data are represented as fold induction of the 
activity seen with GAL-CBP alone. The mean and standard deviation obtained 
from ten experiments are shown. The expression levels were confirmed by 
Western blotting with antibodies directed against GAL4 DBD and VP 16 AAD 
(data not shown). 

(c) Interaction of TIF2.13 wildtype and TIF2.13 mutants with CBP in 
vitro. GST pull-down experiments were performed with 35 S-labeIed in-vitro- 
translated VP16-TIF2.13 polypeptides and bacterially produced GST and GST- 
CBP. Note that the VP16 activation domain on its own does not interact with 
GST-CBP. 

Figure 11. The TIF2. 1 coactivator fragment efficiently stimulates the 
ligand-dependent AF-2 of ER, RAR and RXR in yeast. No stimulatory effect of 
TIF2.1 on the isolated AF-1 of ER (HE15) is observable. Plasmids expressing 
different regions of hERcc (white), hRARa (grey) and mRXRa (black) fused to 
the ER DBD (hERa(Q) were introduced into the yeast reporter strain PL3(a) 
together with TIF2.1 as indicated. White boxes represent sequences of 
transformants that were grown in the presence or absence of 10"* M of the cognate 
ligand (estradiol for ER, z\\-trans-retinoic acid for RAR, 9-c/^-retinoic acid for 
RXR). OMP decase activities determined on each cell-free extract are expressed 
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in nmol/min/mg protein; the mean and standard deviation of at least four 
experiments are shown. 

Figure 12(a-d). The isolated nuclear receptor-interacting domain (NID) 
of TIF2 acts dominant-negatively on the transcriptional activation by the ER, RXR 
and RAR LBDs. The mean value of induction obtained from the quantitation of 
at least three experiments (relative to the respective receptor LBD activity in 
absence of recombinant TIF2) is indicated below each panel. Expression levels 
of TIF2, TIF2.1 and TIF2.5 were routinely verified by Western blot with mouse 
monoclonal antibody 3Ti3Cl 1 directed against a region of TIF2.5 (not shown). 

(a) Overexpression of the TIF2.5 fragment containing the isolated NID 
reverses the stimulatory effect of the potent coactivator fragment TIF2. 1 . Cos-1 
cells were cotransfected with I ug of the (17m) 5 -TATA-CAT reporter and 0.2 ug 
GAL-ERa(EF) expression vector in the presence or absence of 10 6 M estradiol. 
Where indicated, 0. 1 ug of TIF2. 1 and 2.5 ug of TIF2.5 expression vectors were 
cotransfected in addition. 

(b-d) Full length TIF2 and the coactivator fragment TIF2.1 enhance, 
whereas the nuclear receptor interacting domain TIF2.5 blocks the activity of the 
ER, RXR and RAR LBDs. Cos-1 and HeLa cells were cotransfected with 1 ug 
of the (1 7m) s -TATA-CAT reporter and 0.2 ug of the expression vector encoding 
the respective GAL DBD-fUsion of hERa(EF), mRXRa(DE) or mRARa(DEF). 
In the presence or absence of 1 0" 6 M ligand (E2, estradiol; 9C-RA, 9-cw-retinoic 
acid; T-RA, zXUrans-retinoic acid), together with 0.25 ug or 2.5 ug of TIF2, 
TIF2. 1 and TIF2.5 expression vectors. 

Detailed Description of the Preferred Embodiments 

The present invention provides isolated nucleic acid molecules comprising 
a polynucleotide encoding transcriptional intermediary factor-2 (TIF2) whose 
amino acid sequence is shown in Figure 1 (SEQ ID NO:2). The TIF2 protein of 
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was 



the present invention shares sequence homology with the human steroid receptor 
coactivator SRC-1 (SEQ ID NO:3) (Figure 3). The nucleotide sequence shown 
in Figure 1 (SEQ ID NO: 1) was obtained by sequencing a cDNA clone which 
deposited on June 14, 1996 at the ATCC and given accession number 97612. 

Nucleic Acid Molecules 

In one embodiment of the present invention, isolated nucleic acid 
molecules are provided which encode the TIF2 protein. Sequence similarities 
between TIF2 and SRC-1 (Onate etal, Science 270:1354 (1995)) indicate the 
existence of a novel gene family of NR transcriptional mediators. Using 
information provided herein, such as the nucleotide sequence in Figure 1 (SEQ ID 
NO:l) or the above-described deposited clone, a nucleic acid molecule of the 
present invention encoding a TIF2 polypeptide may be obtained using standard 
cloning and screening procedures. Illustrative of the invention, the nucleic acid 
molecule described in Figure 1 (SEQ ID NO:l) was discovered in a cDNA 
expression library from human placenta tissue. The TIF2 cDNA of the present 
invention encodes a protein of about 159 kDa (1,464 amino acids), which includes 
N-terminal nuclear localization signals (NLSs), one Gin- and three SeryThr-rich 
regions, and two charged clusters (Figure 3). TIF2 is widely expressed, since the 
corresponding transcript was found in several human tissues, including pancreas, 
kidney, muscle, liver, lung, placenta, brain and heart (Figure 2c). 

Isolated nucleic acids of the present invention may be in the form of RNA, 
such as mRNA, or in the form of DNA, including, for instance, cDNA and 
genomic DNA obtained by cloning or produced synthetically. The DNA may be 
double-stranded or single-stranded. Single-stranded DNA or RNA may be the 
coding strand, also known as the sense strand, or it may be the non-coding strand, 
also referred to as the anti-sense strand. 

By "isolated" nucleic acid molecule(s) is intended a nucleic acid molecule, 
DNA or RNA, which has been removed from its native environment. For 
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example, recombinant DNA molecules contained in a vector are considered 
isolated for purposes of the present invention. Additional illustrative examples of 
isolated DNA molecules include recombinant DNA molecules maintained in 
heterologous host cells and purified (partially or substantially) DNA molecules in 
solution. Isolated RNA molecules include in vitro RNA transcripts of the DNA 
molecules of the present invention as well as partially or substantially purified 
mRNA molecules. Isolated nucleic acid molecules according to the present 
invention further include such molecules produced synthetically. 

Isolated nucleic acid molecules of the present invention include DNA 
molecules comprising an open reading frame (ORF) with an initiation codon at 
position 163-165 of the nucleotide sequence shown in Figure 1 (SEQ ID NO:l); 
and DNA molecules which comprise a sequence substantially different than that 
described above but which, due to the degeneracy of the genetic code, still encode 
the TIF2 protein. Of course, the genetic code is well known in the art. Thus, it 
would be routine for one skilled in the art to generate the degenerate variants 
described above. 

In another aspect, the invention provides isolated nucleic acid molecules 
encoding the TIF2 polypeptide having an amino acid sequence as encoded by the 
cDNA clone deposited as ATCC Deposit No. 97612 on June 14, 1996 (American 
Type Culture Collection, (ATCC) Rockville, MD). The invention further provides 
an isolated nucleic acid molecule having the nucleotide sequence shown in Figure 
1 (SEQ ID NO.l) or the nucleotide sequence of the TIF2 cDNA contained in the 
above-described clone, or a nucleic acid molecule having a sequence 
complementary to one of the above sequences. Such isolated nucleic acid 
molecules, preferably DNA molecules, are useful as probes for gene mapping by 
in situ hybridization with chromosomes and for detecting expression of the TEF2 
gene in human tissue, for instance, by Northern blot analysis. 

In another aspect, the invention provides an isolated nucleic acid molecule 
that hybridizes under stringent conditions to the above-described nucleic acid 
molecules. As used herein "stringent conditions" is intended to mean, as a non- 
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limiting example, overnight incubation at 42°C in a solution comprising 50% 
formamide, 5xSSC (150 mM NaCl, 15mM trisodium citrate), 50 mM sodium 
phosphate (pH7.6), 5x Denhardfs solution, 10% dextran sulfate, and 20 ug/ml 
denatured, sheared salmon sperm DNA, followed by washing the filters in 
O.lxSSC at about 65»C. Preferably, such "an isolated nucleic acid molecule that 
hybridizes under stringent conditions" will be at least 1 5 bp, preferably at least 20 
bp, more preferably at least 30 bp, and most preferably, at least 50 bp in length. 

As used herein, "fragments" of an isolated DNA molecule having the 
nucleotide sequence of the deposited cDNA or the nucleotide sequence as shown 
in Figure 1 (SEQ ID NO: 1) is intended to mean DNA fragments at least 15 bp 
more preferably at least 20 bp, and most more preferably at least 30 bp in length 
which are useful as diagnostic probes and primers as discussed above and in more 
detail below. Larger DNA fragments, up to, for example, 500 bp in length, are 
also useful as probes according to the present invention. A fragment of at least 
20 bp in length, for example, is intended to mean fragments which include 20 or 
more contiguous bases from the nucleotide sequence of the deposited cDNA or 
the nucleotide sequence as shown in Figure 1 (SEQ ID NO: 1). As indicated, such 
fragments are useful diagnostically either as a probe according to conventional 
DNA hybridization techniques or as primers for amplification of a target sequence 
by the polymerase chain reaction (PCR). 

Since the gene has been deposited and the nucleotide sequence shown in 
Figure 1 (SEQ ID NO.l) is provided, generating such DNA fragments would be 
routme to the skilled worker in the relevant art. Restriction endonuclease 
cleavage or shearing by sonication, for example, may easily be used to generate 
fragments of various sizes. AJternatively, the DNA fragments of the present 
invention can be generated synthetically according to the methods and techniques 
known and available to those skilled in the art. Ten expressed sequence tags with 
homology to part of the TIF-2 nucleoide sequence were identified by the inventors 
in GenBank: GenBank Accession numbers T77249, R77864, T77464 R77 770 
R08880, T85560, R25318, T85561, R08986 and R26517. 



WO 98/02455 



PCT/US97/12100 



-22- 



The present invention further relates to variants of the nucleic acid 
molecules of the present invention, which encode for fragments, analogs or 
derivatives of the TIF2 protein, e.g., polypeptides having biological activity 
substantially similar to the TIF2 protein. Variants may occur naturally, such as 
isoforms and allelic variants. Non-naturally occurring variants may be produced 
using any of the mutagenesis techniques known and available to those skilled in 



the art 



Such variants include those produced by nucleotide substitutions, deletions 
or additions. The substitutions, deletions or additions may involve one or more 
nucleotides. The variants may be altered in coding or non-coding regions or both. 
Alterations in the coding regions may produce conservative or non-conservative 
amino acid substitutions, deletions or additions. Especially preferred among these 
are silent substitutions, additions and deletions, which do not alter the properties 
and activities of the TIF2 protein or fragment thereof. Also especially preferred 
in this regard are conservative substitutions. 

The present invention is further directed to isolated nucleic acid molecules 
that encode a cytoplasmic TIF2 polypeptide. Full-length TIF2 is a nuclear protein 
due to the presence of N-terminal nuclear localization signals (NLSs) (Figure 3). 
By a "cytoplasmic TIF2 polypeptide", is intended a TIF2 polypeptide that is 
essentially found in the cytoplasm after being recombinant^ expressed in 
mammalian cells. Methods for generating nucleic acid molecules that encode a 
cytoplasmic TIF2 polypeptide include mutating or deleting the NLSs-coding 
N-terminal region of the nucleotide sequence shown in Figure 1 (SEQ ID NO: 1). 
Examples of NLS sequences include amino acids 13-20 and 31-39 and nucleotides 
199-222 and 253-279 of Figure 1 (See also, Figure 3). Suitable mutations to the 
NLSs-coding N-terminal region include substitutions, deletions and insertions 
which result in a nucleic acid molecule that encodes a TIF2 polypeptide lacking 
the nuclear localization function. Methods for generating such mutations will be 
readily apparent to the skilled artisan and are described, for instance, in Molecular 
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Cloning, A Laboratory Manual, 2nd edition, edited by Sambrook, J., Fritsch, E.F. 
and Maniatis, T., (1989), Cold Spring Harbor Laboratory Press. 

Preferably, nucleic acid molecules encoding a cytoplasmic TIF2 
polypeptide will be fragments having a deletion in all or part of the N-terminal 
NLSs coding region. Methods for generating such fragments are described 
below. According to the present invention, such nucleic acid fragments further 
include N-terminal deletions extending beyond the NLSs coding region and may 
also include C-terminal deletions. For example, the present inventors have 
generated a nucleic acid molecule encoding the cytoplasmic TIF2.1 polypeptide 
(amino acids 624 to 1287 in Figures 1 and 3 (SEQ ID NO: 2)), which, like the 
nuclear full-length TIF2, interacts in an agonist-dependent manner with the nuclear 
receptors and enhances nuclear receptor-mediated transcriptional activation. The 
present inventors have also generated a nucleic acid molecule encoding the 
cytoplasmic TEF2.5 polypeptide (amino acids 624-869 in Figures 1 and 3 (SEQ 
ID NO:2)), which interacts with the NID domain of nuclear receptors, but does 
not enhance transcription. Also generated were nucleic acids encoding the 
cytoplasmic TIF2.8 and TIF 2. 12 polypeptides (amino acids 1010-1 179 and amino 
adds 940-1131, respectively, in Figures 1 and 3 (SEQ ID NO:2), which enhance 
transcription, but do not bind to nuclear receptors. Thus, by the invention, nucleic 
acid molecules are provided encoding cytoplasmic TIF2 polypeptides that interact 
in an agonist-dependent manner with nuclear receptors and enhance nuclear 
receptor-mediated transcriptional activation. Also provided are cytoplasmic TIF2 
polypeptides that bind to nuclear receptors, but do not enhance transcription as 
are provided cytoplasmic TIF2 polypeptides that enhance transcription, but do not 
bind to nuclear receptors. As the skilled artisan will recognize, the length of such 
nucleic acid molecules can vary. 

Further embodiments of the invention include isolated nucleic acid 
molecules comprising a polynucleotide having a nucleotide sequence at least 90% 
identical, and more preferably at least 95%, 96%, 97%, 98%, or 99% identical to: 
(a) the nucleotide sequence of the cDNA deposited as ATCC 97612; (b) the 
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nucleotide sequence shown in Figure 1 (SEQ ID NO.l); (c) the nucleotide 
sequence of the cDNA deposited as ATCC 97612 which encodes the full-length 
TIF2 protein; (d) the nucleotide sequence shown in Figure 1 (SEQ ID NO:l) 
which encodes the full-length TIF2 protein; (e) the nucleotide sequence of the 
cDNA deposited as ATCC 97612, which encodes the functional coactivator 
TIF2.1 protein; (f) the nucleotide sequence shown in Figure 1 (SEQ ID NO:l), 
which encodes the functional coactivator TIF2.1 protein; (g) the nucleotide^ 
sequence shown in Figure 1 (SEQ ID NO:l) which encodes the TIF2.0 
polypeptide; (h) the nucleotide sequence shown in Figure 1 (SEQ ID NO. l) which 
encodes the TBF2.2 polypeptide; (i) the nucleotide sequence shown in Figure 1 
(SEQ ID NO:l) which encodes the TIF2.3 polypeptide; (j) the nucleotide 
sequence shown in Figure 1 (SEQ ID NO:l) which encodes the TIF2.4 
polypeptide; (k) the nucleotide sequence shown in Figure 1 (SEQ ID NO: 1) which 
encodes the TIF2.5 polypeptide; (1) the nucleotide sequence shown in Figure 1 
(SEQ ID NO:l) which encodes the TIF2.6 polypeptide; (m) the nucleotide 
sequence shown in Figure 1 (SEQ ID NO:l) which encodes the TIF2.7 
polypeptide; (n) the nucleotide sequence shown in Figure 1 (SEQ ID NO:l) 
which encodes the TIF2.8 polypeptide; (o) the nucleotide sequence shown in 
Figure 1 (SEQ ID NO:l) which encodes the TIF2.9 polypeptide; (p) the 
nucleotide sequence shown in Figure 1 (SEQ ID NO: I) which encodes the 
TIF2.10 polypeptide; (q) the nucleotide sequence shown in Figure 1 (SEQ ID 
NO:l) which encodes the TIF2.12 polypeptide; and (r) a nucleotide sequence 
complementary to any of the nucleotide sequences in (a-q). 

Whether any two nucleic acid molecules have nucleotide sequences that 
are at least 90%, 95%, 96%, 91%, 98%, or 99% "identical" can be determined 
conventionally using known computer algorithms such as the "fastA" program 
using, for example, the default parameters (Pearson and Lipman, Proc. Natl 
Acad. Sci. USA 85:2444 (1988)). The present application is directed to such 
nucleic acid molecules having a nucleotide sequence at least 90%, 95%, 96%, 
97%, 98%, 99%, identical to the nucleotide sequence of the above-recited nucleic 
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acid molecules irrespective of whether they encode a polypeptide having TIF2 
activity. This is because, even where a particular nucleic acid molecule does not 
encode a polypeptide having TIF2 activity, one of skill in the art would still know 
how to use the nucleic acid molecule as a probe. Uses of the nucleic acid 
molecules of the present invention that do not encode a polypeptide having TIF2 
activity include, inter alia, (1) isolating the TIF2 gene or allelic variants thereof 
in a cDNA library; (2) in situ hybridization (FISH) to metaphase chromosomal 
spreads to provide precise chromosomal location of the TIF2 gene as described 
in Vermae/ar/., Human Chromosomes: a Manual of Basic Techniques, Pergamon 
Press, New York (1988); and Northern Blot analysis for detecting TIF2 mRNA 
expression in specific tissues, such as placenta tissue. 

Preferred, however, are nucleic acid molecules having a nucleotide 
sequence at least 90%, and preferably at least 95%, 96%, 97%, 98%, or 99% 
identical to the nucleotide sequence of the above-described nucleic acid molecules 
which do, in fact, encode a polypeptide having at least one TIF2 protein activity. 
As used herein, "a polypeptide having a TIF2 protein activity" is intended to mean 
polypeptides exhibiting similar, but not necessarily identical, activity as at least one 
biological activity of the TIF2 protein as measured in a particular biological assay. 
For example, the TIF2 protein of the present invention interacts directly in an 
agonist-dependent manner with the Iigand binding domains of several nuclear 
receptors. Moreover, when recombinantly expressed in mammalian cells, the TIF2 
protein of the present invention enhances transcription via CBP-dependent and " 
CBP-independent routes. 

Thus, "a polypeptide having a TIF2 protein activity" includes polypeptides 
having one or more of the following activities: interaction with the LBD of one or 
more NRs in an agonist-dependent manner; enhancement of CBP-dependent 
transcriptional activation; or enhancement of CBP-independent transcriptional 



activation. 



Screening assays for determining whether a candidate polypeptide has 
TIF2 protein activity are described in detail in Examples 1, 3, 4, and 6 below. For 
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example, by performing such assays, the present inventors have discovered that 
the functional coactivator fragment TIF2.1 (amino acids 624 to 1287 in Figures 
1 and 3 (SEQ ID NO: 2)) is "a polypeptide having a TIF2 protein activity." The 
present inventors have also discovered that the fragment TIF2.5 (amino acids 624- 
869) binds to the LBD of NRs without activating transcription, and is "a 
polypeptide having a TIF2 protein activity." Also discovered was the fragment 
TIF2.2 (amino acids 1288-1464 as shown in Figure 1 (SEQ ID NO:2)), which 
enhances CBP-independent transcription. Thus, TIF2.2 is "a polypeptide having 
a TIF2 protein activity." Another fragment discovered by the inventors, TIF 2.8 
(amino acids 1010-1 179 as shown in Figure 1 (SEQ ID NO:2)) is a "polypeptide 
having a TIF2 protein activity" as it activates CBP-dependent transcription. 

Due to the degeneracy of the genetic code, one of ordinary skill in the art 
will immediately recognize that a large number of the nucleic acid molecules 
having a nucleotide sequence at least 90%, preferably at least 95%, 96%, 97%, 
98%, 99% identical to the nucleotide sequence of the above-described nucleic acid 
molecules will encode "a polypeptide having a TIF2 protein activity." In fact, 
since degenerate variants all encode the same polypeptide, this will be clear to the 
skilled artisan even without performing the above described screening assays. It 
will be further recognized by those skilled in the art that, for such nucleic acid 
molecules that are not degenerate variants, a reasonable number will also encode 
a polypeptide having a TIF2 protein activity. This is because the skilled artisan is 
fully aware of possible amino acid substitutions that are either less likely or not 
likely to significantly affect protein function {e.g., replacing one aliphatic amino 
acid with a second aliphatic amino acid). 

Guidance concerning how to make phenotypically silent amino acid 
substitutions is provided, for example, in J.U. Bowie et at., "Deciphering the 
Message in Protein Sequences: Tolerance to Amino Acid Substitutions," Science 
247: 1306- 13 10 (1990), wherein the authors indicate that there are two main 
approaches for studying the tolerance of an amino acid sequence to change. The 
first method relies on the process of evolution, in which mutations are either 
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accepted or rejected by natural selection. The second approach uses genetic 
engineering to introduce amino acid changes at specific positions of a cloned gene 
and selections or screens to identify sequences that maintain functionality. As the 
authors state, these studies have revealed that proteins are surprisingly tolerant of 
amino acid substitutions. The authors further indicate which amino acid changes 
are likely to be permissive at a certain position of the protein. For example, most 
buried amino acid residues require nonpolar side chains, whereas few features of 
surface side chains are generally conserved. Other such phenotypically silent 
substitutions are described in Bowie etal., supra, and the references cited therein. 

Vectors and Host Cells 

The present invention also relates to vectors which include the isolated 
DNA molecules of the present invention, host cells which are genetically 
engineered with the recombinant vectors, and the production of TIF2 polypeptides 
or fragments thereof, such as TIF2.1, by recombinant techniques. 

Recombinant constructs may be introduced into host cells using well 
known techniques such infection, transduction, transfection, transvection, 
electroporation and transformation. The vector may be, for example, a phage, 
plasmid, viral or retroviral vector. Retroviral vectors may be replication 
competent or replication defective. In the latter case, viral propagation generally 
will occur only in complementing host cells. 

The polynucleotides may be joined to a vector containing a selectable 
marker for propagation in a host. Generally, a plasmid vector is introduced in a 
precipitate, such as a calcium phosphate precipitate, or in a complex with a 
charged lipid. If the vector is a virus, it may be packaged in vitro using an 
appropriate packaging cell line and then transduced into host cells. 

Preferred are vectors comprising cis-acting control regions to the 
polynucleotide of interest. Appropriate trans-acting factors may be supplied by the 
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host, supplied by a complementing vector or supplied by the vector itself upon 
introduction into the host. 

In certain preferred embodiments in this regard, the vectors provide for 
specific expression, which may be inducible and/or cell type-specific. Particularly 
preferred among such vectors are those inducible by environmental factors that are 
easy to manipulate, such as temperature and nutrient additives. 

Expression vectors useful in the present invention include chromosomal-, 
episomal- and virus-derived vectors, e.g., vectors derived from bacterial plasmids, 
bacteriophage, yeast episomes, yeast chromosomal elements, viruses such as 
baculoviruses, papova viruses, vaccinia viruses, adenoviruses, fowl pox viruses, 
pseudorabies viruses and retroviruses, and vectors derived from combinations 
thereof, such as cosmids and phagemids. 

The DNA insert should be operatively linked to an appropriate promoter, 
such as the phage lambda PL promoter, the E. coli lac, trp and tac promoters, the 
SV40 early and late promoters and promoters of retroviral LTRs, to name a few. 
Other suitable promoters will be known to the skilled artisan. The expression 
constructs will further contain sites for transcription initiation, termination and, in 
the transcribed region, a ribosome binding site for translation. The coding portion 
of the mature transcripts expressed by the constructs will preferably include a 
translation initiating AUG at the beginning and a termination codon (UAA, UGA 
or UAG) appropriately positioned at the end of the polypeptide to be translated. 

As indicated, the expression vectors will preferably include at least one 
selectable marker. Such markers include dihydrofolate reductase or neomycin 
resistance for eukaryotic cell culture and tetracycline or ampicillin resistance genes 
for culturing in E. coli and other bacteria. Representative examples of appropriate 
hosts include, but are not limited to, bacterial cells, such as E. coli, Streptomyces 
and Salmonella typhimurium cells; fungal cells, such as yeast ceUs; insect cells such 
as Drosophila S2 and Spodoptera Sf9 cells; animal cells such as CHO, Cos and 
Bowes melanoma cells; and plant cells. Appropriate culture mediums and 
conditions for the above-described host cells are known in the art. 
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IUustrative examples of vectors preferred for use in bacteria include, but 
are not limited to, pA2, pQE70, pQE60 and pQE-9, available from Qiagen; pBS 
vectors, Phagescript vectors, Bluescript vectors, pNH8A, pNH16a, pNHl8A, 
pNH46A, available from Stratagene; and ptrc99a, pKK223-3, pKK233-3, 
P DR540, pRIT5 available from Pharmacia. Preferred eukaryotic vectors include, 
but are not limited to, pWLNEO, pSV2CAT, pOG44, pXTl and pSG available 
from Stratagene; and pSVK3, pBPV, pMSG and pSVL available from Pharmacia. 
Other suitable vectors will be readily apparent to the skilled artisan. 

Among known bacterial promoters suitable for use in the present invention 
include the E. coli lad and lacZ promoters, the T3 and T7 promoters, the gpt 
promoter, the lambda PR and PL promoters and the tip promoter. Suitable 
eukaryotic promoters include the CMV immediate early promoter, the HSV 
thymidine kinase promoter, the early and late SV40 promoters, the promoters of 
retroviral LTRs, such as those of the Rous sarcoma virus ("RSV"), and 
metallothionein promoters, such as the mouse metallothionein-I promoter. 

Introduction of the construct into the host cell can be effected by calcium 
phosphate transfection, DEAE-dextran mediated transfection, cationic lipid- 
mediated transfection, electroporation, transduction, infection or other methods. 
Such methods are described in many standard laboratory manuals, such as Davis 
et al , Basic Methods in Molecular Biology ( 1 986). 

Transcription of the DNA encoding the polypeptides of the present 
invention by higher eukaryotes may be increased by inserting an enhancer 
sequence into the vector. Enhancers are cis-acting elements of DNA, generally 
about 10 to 300 bp in size, that act to increase transcriptional activity of a 
promoter in a given host cell-type. Illustrative examples of enhancers include, but 
are not limited to, the SV40 enhancer, which is located on the late side of the 
replication origin at bp 100 to 270, the cytomegalovirus early promoter enhancer, 
the polyoma enhancer on the late side of the replication origin, and adenovirus 
enhancers. 
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For secretion of the translated protein into the lumen of the endoplasmic 
reticulum, into the periplasmic space or into the extracellular environment, 
appropriate secretion signals may be incorporated into the expressed polypeptide. 
The signals may be endogenous to the polypeptide or they may be heterologous 
signals. 

The polypeptide may be expressed in a modified form, such as a fusion 
protein, and may include not only secretion signals, but also additional 
heterologous functional regions. Thus, for instance, a region of additional amino 
acids, particularly charged amino acids, may be added to the N-terminus of the 
polypeptide to improve stability and persistence in the host cell, during 
purification, or during subsequent handling and storage. Also, peptide moieties 
may be added to the polypeptide to facilitate purification. 

The TIF2 protein or fraction thereof can be recovered and purified from 
recombinant cell cultures by well-known methods including ammonium sulfate or 
ethanol precipitation, acid extraction, anion or cation exchange chromatography, 
phosphocellulose chromatography, hydrophobic interaction chromatography, 
affinity chromatography, hydroxylapatite chromatography and lectin 
chromatography. Most preferably, high performance liquid chromatography 
("HPLC") is employed for purification. 

Polypeptides of the present invention include, but are not limited to, 
naturally purified products, products of chemical synthetic procedures, and 
products produced by recombinant techniques from a prokaryotic or eukaryotic 
host, including, for example, bacterial, yeast, higher plant, insect and mammalian 
cells. Depending upon the host employed in a recombinant production procedure, 
the polypeptides of the present invention may be post translationally modified 
(e.g., glycosylated, phosphorylated, farnesylated, etc.). In addition, polypeptides 
of the invention may also include an initial modified methionine residue, in some 
cases as a result of host-mediated processes. 
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TIF2 Polypeptides and Fragments 

The invention further provides an isolated TIF2 polypeptide having the 
amino acid sequence encoded by the deposited cDNA, or the amino acid sequence 
as shown in Figure 1 (SEQ ED NO:2), or a fragment thereof. Preferred 
polypeptide fragments will have a TIF2 protein activity. In order for a TIF2 
polypeptide to interact in an agonist-dependent manner with nuclear receptors and 
to enhance nuclear receptor-mediated transcriptional activation, such TIF2 
polypeptide fragments should at least include amino acid residues 624 to 1 13 1 as 
shown in Figure 1(SEQ ID NO:2) or amino acid substitutions, additions or 
deletions thereof that are not significantly detrimental to the polypeptides' ability 
to interact in an agonist-dependent manner with nuclear receptors and to enhance 
nuclear receptor-mediated transcriptional activation. In order for a TIF2 
polypeptide fragment to interact with the LBD of an NR without activating 
transcription, the TIF2 polypeptide fragments should at least include amino acids 
624-869 as shown in Figure 1 (SEQ ID NO:2), or amino acid substitutions, 
additions or deletions thereof that are not significantly detrimental to the 
polypeptides' ability to interact with the LBD of an NR. In order for a TIF2 
polypeptide fragment to activate CBP-dependent transcription, the TIF2 
polypeptide should at least include amino acid residues 1010-1 131 as shown in 
Figure 1 (SEQ ID NO:2) or amino acid substitutions, additions or deletions 
thereof that are not significantly detrimental to the polypeptides' ability to activate 
CBP-dependent transcription. For a TIF2 polypeptide to activate CBP- 
independent transcription, the TIF2 polypeptide should at least include amino acid 
residues 1288-1464 as shown in Figure 1 (SEQ ID NO:2) or amino acid 
substitutions, additions or deletions thereof that are not significantly detrimental 
to the polypeptides' ability to activate CBP-indepentent transcription. 

Exemplary TIF2 polypeptide fragments according to the present invention 
include cytoplasmic TIF2 polypeptides having at least one mutation or deletion in 
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a N-termina! MLS region that interferes with the nuclear IocaUzation function. 
Methods for generating cytoplasmic TIF2 polypeptides are described above. 

As used herein, an "isolated" polypeptide or protein is intended to mean 
a polypeptide or protein removed from its native environment, such as 
recombinants produced polypeptides and proteins expressed in host cells and 
native or recombinant polypeptides which have been substantially purified by any 
suitable technique (e.g., the single-step purification method disclosed in Smith and 
Johnson, Gene 67:31-40 (1988), which is incorporated by reference herein). 
Isolated polypeptides or proteins according to the present invention further 
include such compounds produced synthetically: 

The present inventors have discovered that the full-length TIF2 protein 
is an about 1464 amino acid residue protein with a deduced molecular weight of 
about 160 kDa. It will be recognized by those skilled in the art that some amino 
acid sequence of the TIF2 protein can be varied without significant effect on the 
structure or function of the protein. If such differences in sequence are 
contemplated, it should be remembered that there will be critical areas on the 
protein which determine activity, such as the region described above which has 
been determined by the inventors as being critical to the protein's ability to 
enhance nuclear receptor-mediated transcriptional activation. In general, it is 
often possible to replace residues which form the tertiary structure, provided that 
residues performing a similar function are used. In other instances, the type of 
residue may be completely unimportant if the alteration occurs at a non-critical 
region of the protein. 

Thus, the present invention further includes variations of the TIF2 
polypeptide which show substantial TIF2 polypeptide activity or which include 
regions of TIF2 protein such as the protein fragments discussed below. Such 
mutants include deletions, insertions, inversions, repeats, and type substitutions 
(for example, substituting one hydrophilic residue for another, but not strongly 
hydrophilic for strongly hydrophobic as a rule). Small changes or such "neutral" 
amino acid substitutions will generally have little effect on activity. 
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Typically seen as conservative substitutions are the replacements one for 
another, among the aliphatic amino acids Ala. Val, Leu and He; interchange of the 
hydroxyl residues S=r and Thr. exchange of the acidic residues Asp and CHu 
substation between the amide residues Asn and Gin, exchange of the basic' 
rescues Lys and Arg and replacements among the aromatic residues Phe and Tyr 
As mdicated in d«ail above, further guidance concerning which amino acid 
changes are likely to be phenotypically silent (/.„., no, nicely ,o have a significant 
deletenous effect on a function) can be found in Bowie e, al.. "Deciphering the 
Message in Protein Sequences: Tolerance to Amino Acid Substitutions," Science 
•2¥7:1306-I310(I990)). 

The polypeptides of the present invention include polypeptides having an 
ammo acd sequence as encoded by the deposited cDNA, an amino acid sequence 
as shown in SEQ ID N0:2 , as well a „ amino ^ ^ ^ ^ 

.dentical, more preferably a, leas, 90% identical, and most preferably at least 95% 
96%, 97%, 98%, or 99% identical, ,„ the amino acid sequence encoded by the 
deposrted cDNA, to the amino acid sequence as shown in SEQ ID NO 2 or to the 
amino acid sequence of a polypeptide fragment described above. Whether two 
polypepttdes have an amino acid sequence that is at least 80%, 90% or 95% 
■dentical can be determined using a computer algorithm as described above 

As described in detail below, the nucleic acid molecules and polypeptides 
of the present invention are useful in screerdng assays for identifying agonist and 
antagonist of NRAF2-media,ed,ra„sactiva,io„. Por example, in Halactai S 
e, al., Science 264: 1455 (1994), the authors show mat tamoxifen, which has 
grow* inhibitory eflecs in breast cancer, disrupts the formation of a complex tna, 
utciudes the estrogen receptor and ERAPI60. Accordingly, the nucleic acid 
molecules and polypeptides of the present invention are useful in assays for 
■denttlymg drugs capable of enhancing or inhibiting nuclear receptor-meoiated 
pathways. 

The nucleic acid molecules and polypeptides of the present invention are 
useful m screening assays for identifying agonists and antagonists of TIF2 ADI 
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activity, and in screening assays for identifying agonist and antagonists of TIF2 
AD2 activity, as described in detail below. 

Screening Methods 

Nuclear receptors (NRs) are members of a superfamily of ligand-inducible 
transcriptional regulatory factors that include steroid hormone, thyroid hormone, 
vitamin D3 and retinoid receptors (Leid, M, et al. Trends Biochem, Sci. 17 ATI - 
433 (1992); Leid, M, etai, Cell 68:311-395 (1992); and Linney, ECurr. Top. 
Dev. Biol., 27:309-350 (1992)). NRs exhibit a modular structure which reflects 
the existence of several autonomous functional domains. Based on amino acid 
sequence similarity between the chicken estrogen receptor, the human estrogen 
and glucocorticoid receptors, and the v-erb-A oncogene, Krust, A., el al, UMBO 
J. 5:891-897 (1986), defined six regions, A, B, C, D, E and F (see Figure 6) , 
which display different degrees of evolutionary conservation amongst various 
members of the nuclear receptor superfamily. The highly conserved region C 
contains two zinc fingers and corresponds to the core of the DNA-binding domain 
(DBD), which is responsible for specific recognition of the cognate response 
elements. Region E is functionally complex, since in addition to the ligand-binding 
domain (LBD), it contains a ligand-dependent activation function (AF-2) and a 
dimerization interface. An autonomous transcriptional activation function (AF-1) 
is present in the non-conserved N-terminal A/B regions of the steroid receptors. 
Interestingly, both AF-1 and- AF-2 of steroid receptors exhibit differential 
transcriptional activation properties which appear to be both cell type and 
promoter context specific (Gronemeyer, H. Annu. Rev. Genet. 25:89-123 (1991)). 

The all-trans (T-RA) and 9-cis (9C-RA) retinoic acid signals are 
transduced by two families of nuclear receptors, RAR a, P and y (and their 
isoforms) are activated by both T-RA and 9C-RA, whereas RXR a, p and y are 
exclusively activated by 9C-RA (Allenby, G. et al., Proc. Natl. Acad. Sci. USA 
90:30-34 (1993)). The three RAR types differ in their B regions, and their main 
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isoforms (al and al, pi -4, and Y l and yl) have different N-terminal A regions 
(Leid, M. etaL, Trends Biochem. Sci. 77:427-433 (1992)). Similarly, the RXR 
types differ in their A/B regions (Mangelsdorf, D.J. el al, Genes Dev. <5:329-344 
(1992)). 

The E-region of RARs and RXRs has also been shown to contain a 
dimerization interface (Yu, V.C. et al, Curr. Opin. Biotechnol. 3:597-602 
(1 992)). Most interestingly, it was demonstrated that RAR/RXR heterodimers 
bind much more efficiently in vitro than homodimers of either receptor to a 
number of RA response elements (RAREs) (Yu, V.C. et al.. Cell <57:1251-1266 
(1991); Berrodin, T. J. etaL, Mol. Endocrinol 5:1468-1478 (1992); Bugge, T. H 
et al, EMBOJ. 77:1409-1418 (1992); Hall, R. K. etaL, Mol. Cell. Biol. 12: 
5527-5535 (1992); Hallenbeck, P. L. etaL, Proc. Natl. Acad. Sci. USA 89:5512- 
5576 (1992); Husmann, M. et al., Biochem. Biophys. Res. Commun. 187. 1558- 
1564 (1992); Kliewer, S.A. et al., Nature 355:446-449 (1992b); Leid, M. et al., 
Cell 68:311-295 (1992); Marks, M. S. etaL, EMBOJ. 77:1419-1435 (1992); 
Zhang, X. K. et al., Nature 355:441-446 (1992)). RAR and RXR heterodimers 
are also preferentially formed in solution in vitro (Yu, V.C. et al., Cell 67: 125 1- 
1266 (1991); Leid, M etaL, Cell 68:311-395 (1992); Marks, M. S. etaL, EMBO 
J. 11: 1419.1435 (1992)), although the addition of 9C-RA appears to enhance the 
formation of RXR homodimers in vitro (Lehman, J. M. et al. Science 258: 1944- 
1946 (1992); Zhang, X. K. etal., Nature 35*587-591 (1992b)). That RAR-RXR 
heterodimers, rather than the corresponding homodimers, preferentially bind to 
RAREs in cultured cells in vivo has been strongly supported by experiments 
described in Durand, B. etaL, Cell 77.73-85 (1992). 

As retinoic acid is known to regulate the proliferative and differentiative 
capacities of several mammalian cell types (Gudas, L.J. et al (1994) in Sporn, 
M.B., Roberts, A.B. and Goodman, D.S.(eds), The Retinoids. 2nd edition, Raven 
Press, New York, pp. 443-520), retinoids are used in a variety of chemopreventive 
and chemotherapeutic settings. The prevention of oral, skin and head and neck 
cancers in patients at risk for these tumors has been reported (Hong , W. K. et al, 
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N. Engl. J. Med. J/J: 150 1-1 505 (1986); Hong, W. K. et al, N Engl. J. Med 
323:795-801 (1990); Kraemer, K. H. et al, N. Engl. J. Med. 375:1633-1637 
(1988); Bollag, W. et al., Ann. Oncol. 3:513-526 (1992); Chiesa, F. etal, Eur. 
J. Cancer B Oral Oncol. 25:97-102 (1992); Costa, A. et al, Cancer Res. 
5¥:Suppl. 7, 2032-2037 (1994)), and retinoids are used in the therapy of acute 
promyelocytic leukemia (Huang, M.E. et al. Blood 72.567-572 (1988); 
Castaigne, S. et al. Blood 76M704-1709 (1990); Chomienne, C. et al, 
73W7 < 5:1710-1717(1990);Chomienne, C. etal, J. Clin. Invest. 88 :2 150-2 154 

(1991) ; Chen Z. et al, Leukemia 5:288-292 (1991); Lo Coco, F. et al, 
Blood 77:165701659 (1991); Warrell, R. P., Jr. etal, N. Engl J. Med 32*1385- 
1393 (1991)), squamous cell carcinoma of the cervix and the skin (Verma, A. K., 
Cancer Res. ¥7:5097-5101 (1987); Lippmaa S. M. et al, J. Natl Cancer 
Inst. 5*235-241 (1992); Lippman S. M. etal, J. Natl Cancer Inst. 5*241-245 

(1992) ) and Kaposi sarcoma (Bonhomme, L. et al, Ann. Oncol 2:234-235 
(1991)). 

For example, in Chen, J-Y et al, EMBOJ. I4(6):U&1-\\91 (1995), a 
number of dissociating synthetic retinoids are characterized that selectively induce 
AF-2 activation function present in the LBD of RARP (PAF-2). The authors also 
report that these synthetic retinoids, like RA, can inhibit the anchorage- 
independent growth of oncogene-transformed 3T3 cells. Further, the promoter 
of the human interleukin-6 (IL-6) gene, whose product is involved in the 
regulation of hematopoiesis, immune responses and inflammation (Kishimoto, T. 
et al, Science 255:593-597 (1992)), is induced by RA but not by the 'dissociating 1 
retinoids which repressed its activity. 

In addition to the retinoid receptors, compounds with agonistic and 
antagonistic properties on functions of the steroid receptors have also been 
reported. For example, in Meyer, M-E. etal, EMBOJ. 9(12): 3923-3932 (1990), 
a transient expression/gel retardation system was used to study the effects of 
RU486 and R5020 on glucocorticoid and progesterone receptor function. 
Further, in Halachimi, S., etal. Science 2<W;1455-1458 (1994), tamoxifen is 
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shown to competitively inhibit estradiol-induced ERAP160 binding to the estrogen 
receptor, suggesting a mechanism for its growth-inhibitory effects in breast cancer. 
Accordingly, due to their clinical importance, there is great interest in developing 
screening methods capable of identifying agonist and antagonist of nuclear 
receptor transactivation. 

As indicated, the present inventors have cloned a gene encoding TIF2 and 
have shown that TIF2 and a cytoplasmic fragment thereof bind, in an agonist- 
dependent manner, to all nuclear receptors analyzed— RAR, RXR, ER, TR, 
VDR, GR and AR. Further, the present inventors have shown that TIF2 
polypeptides are transcriptional mediators of the nuclear receptor AF-2. Thus, the 
present invention further provides a screening method for identifying a nuclear 
receptor (NR) antagonist, which involves: (a) providing a host cell containing 
recombinant genes which express a polypeptide comprising a NR ligand binding 
domain (LBD) and a polypeptide comprising transcriptional intermediary factor-2 
(TIF-2) or a TIF-2-fragment, wherein, in the presence of an agonist, said TIF-2 
and said TIF-2-fragment bind said NR LBD; (b) administering a candidate 
antagonist to said cell; and (c) determining whether said candidate antagonist 
reduces either: (1) TIF-2- or TIF-2-fragment-binding to the AF-2 of said NR LBD 
as compared to said binding in the absence of said candidate antagonist; or (2) 
TIF-2- or TIF-2-fragment-stimulated NR LBD AF-2-mediated transactivation as 
compared to said transactivation in the absence of said candidate antagonist. 

In a further aspect, a screening method is provided for identifying a nuclear 
receptor (NR) agonist, which involves: (a) providing a host cell containing 
recombinant genes which express a polypeptide comprising a NR ligand binding 
domain (LBD) and a polypeptide comprising transcriptional intermediary factor-2 
(TIF-2) or a TEF-2-fragment, wherein, in the presence of an agonist, said TIF-2 
and said TIF-2-fragment bind said NR LBD; (b) administering a candidate agonist 
to said cell; and (c) determining whether said candidate agonist enhances either: 
(1) TIF-2- or TIF-2-fragment-binding to the AF-2 of said NR LBD as compared 
to said binding in the absence of said candidate agonist; or (2) TIF-2- or TIF-2- 
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fragment-stimuiated NRLBD AF-2-mediated transactivation as compared to said 
transactivation in the absence of said candidate agonist. 

By "a host cell containing recombinant genes" is intended host cells.into 
which one or more of the recombinant constructs described herein have been 
introduced or a progeny of such host cells. 

Candidate antagonist and agonist according to the present invention 
include 'dissociating* ligands for nuclear receptors such as those described in Chen 
etal, EMBOJ. 14:\ 187-1197 (1995) and Ostrowski etal, Proc. Natl. Acad. Sci. 
USA 92:1812-1816 (1995). Progesterone and glucocorticoid receptor agonist and 
antagonist are described in Meyer etal., EMBO J. 9 (12): 3923-3932 (1990). An 
estrogen receptor antagonist is described in Halachmi et al.. Science 264:1455- 
1458 (1994). Thus, methods are known in the art for developing candidate 
nuclear receptor agonist and antagonist for screening according to the present 
invention. For example, the crystal structure of the ligand binding domains of 
certain nuclear receptors have been described. In particular, the crystal structure 
of the RXRLBD is described in Bourguet et al., Nature 375.377-382 (1995); the 
crystal structure of the RAR LBD is described in Renaud et al.. Nature 375:681- 
689 (1995); and the crystal structure of a thyroid hormone receptor is described 
in Wagner et al., Nature 378:690-697 (1995). Using information from the crystal 
structure of a nuclear receptor, computer programs are available for designing the 
structure of candidate agonist and antagonist which would likely bind to the ligand 
binding domain. Suitable computer program packages for this purpose include 
WHAT IF (Vriend, G., J. Mol. Graphics 8:52-56 (1990)), and GRID (Goodford, 
J. Med. Chem. 25:849-857 (1985)). 

Recombinant genes encoding a polypeptide comprising TIF2 or a TIF2- 
fragment capable of binding nuclear receptors in an agonist -dependent manner are 
described above. Recombinant genes encoding a polypeptide comprising a NR 
LBD have been described in great detail in the art. Methods for determining 
whether a candidate agonist or antagonist enhances or interferes with TIF-2 or 
TIF-2-fragment binding to a NR are known in the art. For example, the effect of 
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a candidate agonist or antagonist on TIF2- or TIF-2-fragment-binding to a NR 
LBD can be studied using glutathione-S-transferase (GST) interaction assays by 
tagging MR LBDs with GST as described in detail in the Experimental section 
below and in Le Douarin etal., EMBO J 1 4:2020-2033 (1995). 

Where the effect of a candidate agonist or antagonist on NR AF-2 
transaction is to be assayed, preferably, the recombinant genes will encode a 
chimeric polypeptide comprising a NR LBD fused to a DNA binding domain from 
a transactivator protein. In a further preferred embodiment, the host cell 

expressing the recombinant genes will also express a reporter eene. For example 
in Chen et aL. EMBO J. J4(6):l 187-1 197 (1995), three 'reporter' cell lines havi 
been established in which RARa, RAR P , or RARy agonists induce luciferase 
activity that can be measured in the intact cells using a single-photon-counting 
camera. These cell lines stably express chimeric proteins containing the DNA 
binding domain of the yeast transactivator GAL4 fused to the EF regions (which 
contain that LBD and the AF-2 activation function) of RARa (GAL-RARa) 
RARP (GAL-RARp) or RARy (GAL-RARy), and a luciferase reporter gene 
driven by a pentamer of the GAL4 recognition sequence ('17m') in front of the p. 
globin promoter (17mx5-G-Luc). This reporter system is insensitive to 
endogenous receptors which cannot recognize the GAL4 binding site. Further 
examples of reporter genes and reporter expression vectors for use according to 
the present invention to screen candidate agonist and antagonist of retinoid 
receptors are provided in Figure 6. 

The ER expression vectors HEO, HE19 and HE15, the GR expression 
vectors HGl and HG3 and the PR expression cPRl and cPR3 are described in 
Kumar etal.. Cell 57.94 1-951 (1987) and Gronemeyer etal.. EMBO J. <5;3985- 
3994 (1987). The GR expression vector HG8 and the PR expression vector 
cPR5A are described in Bocquel etal., Nucl. Acids Res. 77.258 1-2595 (1989). 
Reporter genes for the above described ER, GR and PR expression vectors 
include MMTV-CAT (in the case of PR and GR; Cato et al. EMBO J. 5:2231- 
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2240 (1986)) and vit-tk-CAT (in the case of ER; Klein-Hitpass et a!., Cell 
-^7:1055-1061 (1986)). 

The TR expression vector LexA-TR is described in Lee et ai, Nature 
574:91-94(1995), which also describes using the yeast two hybrid system to 
identify compounds that affect TR transactivation. 

Still further references disclosing reporter plasmids containing a reporter 
gene and expression vectors encoding a NR LBD include Meyer et al % Cell 
57:433-442 (1989); Meyer etal, EMBO J. 9(72^:3923-3932 (1990); Tasset et 
al, Cell 62:1177-1187 (1990); Gronemeyer, H. and Laudet, V., Protein Profile 
2:1173-1308 (1995); Webster et ai t Cell 54:199-207 (1988); Strahle et ai t 
EMBO J. 7:3389-3395 (1988); Seipele/a/., EMBO J, 77:4961-4968 (1992); and 
Nagpal et al t EMBO J, 72:2349-2360 (1993). In a particularly preferred 
embodiment, the effect of a candidate agonist or antagonist on NR AF-2-mediated 
transactivation is assayed according to the method described in the legend to 
Figure 5 above. 

The present inventors have identified an activation domain of TEF2, AD1 
(amino acids 1010-1 131 as shown in Figure 1 (SEQ ID NO:2)), which mediates 
the CBP-dependent transcriptional activation function of TIF2. Further, the 
present inventors have shown that polypeptides containing this activation domain, 
when fused to a DNA-binding domain of a transcriptional activator, is capable of 
activating transcription via a CBP-dependent pathway. Accordingly, the present 
invention further provides a screening method of identifying an agonist of TIF2 
AD1 activation domain activity, which involves: (a) providing a host cell 
containing a recombinant gene or genes which express a polypeptide comprising 
a transcriptional activator DNA-binding domain (DBD) and a TIF-2 AD1 
activation domain 1; (b) administering a candidate agonist to said cell; and (c) 
determining whether said candidate agonist enhances TIF2 AD 1 activation domain 
activity. 

The invention further provides for a screening method for identifying an 
antagonist of TEF2 AD1 activation domain activity, which comprises: (a) 
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providing a host cell containing a recombinant gene or genes which express a 
polypeptide comprising a transcriptional activator DNA-binding domain (DBD) 
and a TIF-2 ADl activation domain 1; (b) administering a candidate antagonist to 
said cell; and (c) deterniining whether said candidate antagonist inhibits TIF2 ADl 
activation domain activity. 

By "transcriptional activator" it is meant molecules that enhance the 
initiation of transcription by RNA polymerase B (II). Transcriptional activators 
include yeast transcriptional activators, such as GAL4 and GCN4; the herpes 
simplex activator, VP16; and members of the nuclear receptor family, which 
includes RAR, RXR, ER, TR, VDR, GR, and AR. 

Recombinant genes encoding a polypeptide comprising a TIF2 ADl 
activation domain are described below. Recombinant genes encoding a 
polypeptide comprising a transcriptional activator DBD are well known in the art. 
Methods for determining whether a candidate agonist or antagonist enhances or 
interferes with transcription are well known in the art. For example, the effect of 
a candidate agonist or antagonist of TBF2 ADl activation domain activity can be 
determined using CAT assays as described below and in Gronemeyer et al. (1987) 
and Bocquel et al., Nucl. Acids Res. (1989). 

Where the effect of a candidate agonist or antagonist of TIF2 ADl 
activation domain activity is to be determined, preferably, recombinant genes will 
encode a chimeric polypeptide comprising a transcriptional activator DBD fused 
to a TIF2 polypeptide comprising the ADl activation domain. In a further 
embodiment, the host cell expressing the recombinant genes will also express a 
reporter gene. Examples of reporter genes are described above. In a particularly 
preferred embodiment, the effect of a candidate agonist or antagonist of TIF2 
ADl activation domain function will be determined as described in the legend to 
Figure 7(c). 

The present inventors have also identified a second activation domain of 
TIF2, AD2 (amino acids 1288-1464 as shown in Figure 1 (SEQ ID NO:2)), which 
mediates CBP-independent transcriptional activation. Further, the present 
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inventors have shown that polypeptides containing this activation domain, when 
fused to a DNA-binding domain of a transcriptional activator, are capable of 
activating transcription via a CBP-independent pathway. Accordingly, the present 
invention further provides a screening method for identifying an agonist of TIF2 
AD2 activation domain activity, which comprises: (a) providing a host cell 
containing a recombinant gene or genes which express a polypeptide comprising 
a transcriptional activator DNA-binding domain (DBD) and a TIF-2 AD2 
activation domain; (b) administering a candidate agonist to said cell; and (c) 
determining whether said candidate agonist enhances TIF2 AD2 activation domain 
activity. 

The invention further provides for a screening method for identifying an 
antagonist of TIF2 AD2 activation domain activity, which comprises: (a) 
providing a host cell containing a recombinant gene or genes which express a 
polypeptide comprising a transcriptional activator DNA-binding domain (DBD) 
and a TIF-2 AD2 activation domain; (b) administering a candidate antagonist to 
said cell; and (c) determining whether said candidate antagonist inhibits TIF2 AD2 
activation domain activity. 

Recombinant genes encoding a polypeptide comprising a TIF2 AD2 
activation domain are described below. Recombinant genes encoding a 
polypeptide comprising a transcriptional activator DBD are well known in the art. 
Methods for determining whether a candidate agonist or antagonist enhances or 
interferes with transcription are known in the art. 

Where the effect of a candidate agonist or antagonist of TJF2 AD2 
activation domain activity is to be determined, preferably, recombinant genes will 
encode a chimeric polypeptide comprising a transcriptional activator DBD fused 
to a TIF2 polypeptide comprising the AD2 activation domain. Transcriptional 
activators are described above. In a further embodiment, the host cell expressing 
the recombinant genes will also express a reporter gene. Examples of reporter 
genes are described above. In a particularly preferred embodiment, the effect of 
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a candidate agonist or antagonist of TIF2 AD2 activation domain activity will be 
determined as described in the legend to Figure 7(c). 



TIF-2 Antibodies and Methods 

TIF2 antibodies are also provided by the present invention, as specific for 
a TIF2 protein, a TIF2 polypeptide, a TIF2 protein fragment or a TIF2 
polypeptide fragment. The term "antibody" is meant to include polyclonal 
antibodies, monoclonal antibodies (mAbs), chimeric antibodies, anti-idiotypic 
(anti-Id) antibodies to antibodies that can be labeled in soluble or bound form, as 
well as fragments thereof provided by any known technique, such as, but not 
limited to enzymatic cleavage, peptide synthesis or recombinant techniques. 
Polyclonal antibodies are heterogeneous populations of antibody molecules 
derived from the sera of animals immunized with an antigen. A monoclonal 
antibody contains a substantially homogeneous population of antibodies specific 
to antigens, which population contains substantially similar epitope binding sites. 
MAbs may be obtained by methods known to those skilled in the art. See, for 
example Kohler and Milstein, Nature 256:495-497 (1975); U.S. Patent No. 
4,376,110; Ausubel et al, eds., CURRENT PROTOCOLS IN MOLECULAR 
BIOLOGY, Greene Publishing Assoc. and Wiley Interscience, N. Y., (1987-1996); 
and Harlow and Lane ANTIBODIES: A LABORATORY MANUAL Cold Spring 
Harbor Laboratory (1988); Colligan et al., eds., Current Protocols in 
Immunology, Greene Publishing Assoc. and Wiley Interscience, N.Y., (1992- 
1996), the contents of which references are incorporated entirely herein by 
reference. 

Such antibodies may be of any immunoglobulin class including IgG, IgM, 
IgE, IgA, GILD and any subclass thereof A hybridoma producing a mAb of the 
present invention may be cultivated in vitro, in situ or in vivo. Production of high 
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titers of mAbs in vivo or in situ makes this the presently preferred method of 
production. 

Chimeric antibodies are molecules different portions of which are derived 
from different animal species, such as those having variable region derived from 
a murine mAb and a human immunoglobulin constant region, which are primarily 
used to reduce immunogenicity in application and to increase yields in production, 
for example, where murine mAbs have higher yields from hybridomas but higher 
immunogenicity in humans, such that human/murine chimeric mAbs are used. 
Chimeric antibodies and methods for their production are known in the art (Cabilly 
et al, Proc. Nail. Acad. Sci. USA 81:3273-3277 (1984); European Patent 
Application 125023 (published November 14, 1984); Neuberger et a!., Nature 
3 14:268-270 (1985); Taniguchi et al, European Patent Application 171496 

(1985) ; Morrison et al., European Patent Application 173494 (1986); Neuberger 
et al., PCT Application WO 86/01533, (1986); Kudo et al, European Patent 
Application 184187 (1986); Morrison et al., European Patent Application 173494 

(1986) ; Robinson et al., PCT Publication PCT/US86/02269 (1987); Liu et al, 
Proc. Natl. Acad. Sci. USA W:3439-3443 (1987); Sun et al, Proc. Natl. Acad. 
Sci. USA 5^:214-218 (1987); Better et al. Science 240.1041- 1043 (1988); and 
Harlow and Lane, ANTIBODIES: A LABORATORY MANUAL Cold Spring 
Harbor Laboratory (1988)). These references are entirely incorporated herein by 
reference. 

An anti-idiotypic (anti-Id) antibody is an antibody which recognizes unique 
determinants generally associated with the antigen-binding site of an antibody. An 
Id antibody can be prepared by immunizing an animal of the same species and 
genetic type (e.g., mouse strain) as the source of the mAb with the mAb to which 
an anti-Id is being prepared. The immunized animal will recognize and respond 
to the idiotypic determinants of the immunizing antibody by producing an antibody 
to these idiotypic determinants (the anti-Id antibody). See, for example, U.S. 
patent No. 4,699,880, which is herein entirely incorporated by reference. 
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The anti-Id antibody may also be used as an "immunogen" to induce an 
immune response in yet another animal, producing a so-caiied anti-anti-Id 
antibody. The anti-anti-Id may be epitopically identical to the original mAb which 
induced the anti-Id. Thus, by using antibodies to the idiotypic determinants of a 
mAb, it is possible to identify other clones expressing antibodies of identical 
specificity. The anti-Id mAbs thus have their own idiotypic epitopes, or 
"idiotopes" structurally similar to the epitope being evaluated, such as GRB 
protein-a. 

The term "antibody" is also meant to include both intact immunoglobulin 
molecules as well as fragments thereof, such as, for example, Fab and F(ab%, 
which are capable of binding antigen. Fab and F(ab') 2 fragments lack the Fc 
fragment of intact antibody, clear more rapidly from the circulation, and may have 
less non-specific tissue binding than an intact antibody (WahJ et al., J. Nucl. Med. 
24:316-325 (1983)). It will be appreciated that Fab and F(ab% and other 
fragments of the antibodies useful in the present invention may be used for the 
detection and quantitation of a TIF2 according to the methods disclosed herein for 
intact antibody molecules. Such fragments are typically produced by proteolytic 
cleavage, using enzymes such as papain (to produce Fab fragments) or pepsin (to 
produce F(ab') 2 fragments). 

An antibody is said to be "capable of binding" a molecule if it is capable of 
specifically reacting with the molecule to thereby bind the molecule to the 
antibody. The term "epitope" is meant to refer to that portion of any molecule 
capable of being bound by an antibody which can also be recognized by that anti- 
body. Epitopes or "antigenic determinants" usually consist of chemically active 
surface groupings of molecules such as amino acids or sugar side chains and have 
specific three dimensional structural characteristics as well as specific charge 
characteristics. 

An "antigen" is a molecule or a portion of a molecule capable of being 
bound by an antibody which is additionally capable of inducing an animal to 
produce antibody capable of binding to an epitope of that antigen. An antigen 
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may have one, or more than one epitope. The specific reaction referred to above 
is meant to indicate that the antigen will react, in a highly selective manner, with 
its corresponding antibody and not with the multitude of other antibodies which 
may be evoked by other antigens. 

The antibodies, or fragments of antibodies, useful in the present invention 
may be used to quantitatively or qualitatively detect a TIF2 protein, polypeptide, 
or fragment, in a sample or to detect presence of cells which express a TIF2 of the 
present invention. This can be accomplished by immunofluorescence techniques 
employing a fluorescently labeled antibody (see below) coupled with light 
microscopic, flow cytometric, or fluorometric detection. 

The antibodies (of fragments thereof) useful in the present invention may 
be employed histologically, as in immunofluorescence or immunoelectron 
microscopy, for in situ detection of a TIF2 protein, polypeptide, or fragment, of 
the present invention. In situ detection may be accomplished by removing a 
histological specimen form a patient, and providing a labeled antibody of the 
present invention to such a specimen. The antibody (or fragment) is preferably 
provided by applying or by overlaying the labeled antibody (or fragment) to a 
biological sample. Through the use of such a procedure, it is possible to 
determine not only the presence of a TIF2 protein, polypeptide, or fragment, but 
also its distribution on the examined tissue. Using the present invention, those of 
ordinary skill will readily perceive that any of wide variety of histological methods 
(such as staining procedures) can be modified in order to achieve such in situ 
detection. 

Such assays for a TIF2 protein, polypeptide, or fragment, of the present 
invention typically comprises incubating a biological sample, such as a biological 
fluid, a tissue extract, freshly harvested cells such as lymphocytes or leukocytes, 
or cells which have been incubated in tissue culture, in the presence of a detectably 
labeled antibody capable of identifying a TIF2 protein, polypeptide, or fragment, 
and detecting the antibody by any of a number of techniques well-known in the 
an. 
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The biological sample may be treated with a solid phase support or carrier 
such as nitrocellulose, or other solid support or carrier which is capable of 
immobilizing cells, cell particles or soluble proteins. The support or carrier may 
then be washed with suitable buffers followed by treatment with a detectably 
labeled TIF2-specific antibody. The solid phase support or carrier may then be 
washed with the buffer a second time to remove unbound antibody. The amount 
of bound label on said solid support or carrier may then be detected by 
conventional means. 

By "solid phase support", "solid phase carrier", "solid support", "solid 
carrier", "support" or "carrier" is intended any support or carrier capable of 
binding antigen or antibodies. Well-known supports or carriers, include glass, 
polystyrene, polypropylene, polyethylene, dextran, nylon amylases, natural and 
modified celluloses, polyacrylamides, gabbros, and magnetite. The nature of the 
carrier can be either soluble to some extent or insoluble for the purposes of the 
present invention. The support material may have virtually any possible structural 
configuration so long as the coupled molecule is capable of binding to an antigen 
or antibody. Thus, the support or carrier configuration may be spherical, as in a 
bead, or cylindrical, as in the inside surface of a test tube, or the external surface 
of a rod. Alternatively, the surface may be flat such as a sheet, test strip, etc. 
Preferred supports or carriers include polystyrene beads. Those skilled in the art 
will know many other suitable carriers for binding antibody or antigen, or will be 
able to ascertain the same by use of routine experimentation. 

The binding activity of a given lot of anti-TIF2 antibody may be 
determined according to well known methods. Those skilled in the art will be able 
to determine operative and optimal assay conditions for each determination by 
employing routine experimentation. Other such steps as washing, stirring, 
shaking, filtering and the like may be added to the assays as is customary or 
necessary for the particular situation. 

One of the ways in which a TIF2-specific antibody can be detectably 
labeled is by linking the same to an enzyme and use in an enzyme immunoassay 
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(EIA). This enzyme, in turn, when later exposed to an appropriate substrate will 
react with the substrate in such a manner as to produce a chemical moiety which 
can be detected, for example, by spectrophotometry fluorometric or by visual 
means. Enzymes which can be used detectably label the antibody include but are 
not hmited to, malate dehydrogenase, staphylococci nuclease, delta-5-steroid 
■somerase, yeast alcohol dehydrogenase, alpha-glycerophosphate dehydrogenase 
tnose phosphate isomerase, horseradish peroxidase, alkaline phosphatase' 
asparaginase, glucose oxidase, beta-galactosidase, ribonuclease, urease, catalase' 
glucose-6- phosphate dehydrogenase, glucoamylase and acetylcholinesterase The 
detect.on can be accomplished by colorimetric methods which employ a 
endogenic substrate for the enzyme. Detection may also be accomplished by 
-sual comparison of the extent of enzymatic reaction of a substrate in comparison 
with similarly prepared standards. 

Detection may be accomplished using any of a variety of other 
unmunoassays. For example, by radioactivity labeling the antibodies or antibody 
fragments, it is possible to detect R-PTPase through the use of a 
radioimmurtoassay (RIA). A good description of RIA maybe found in Laboratory 
Techniques and Bio chemistry in Molecular Biology, by Work, T.S. et al North 
Holland Publishing Company, NY (, 978) with particular reference to the chapter 
entitled "An Introduction to Radioimmune Assay and Related Techniques" by 
Chard, T., incorporated by reference herein. The radioactive isotope can be 
detected by such means as the use of a y counter or a scintillation counter or by 
autoradiography. 

I« is also possible to label an a„,i-TIF2 antibody with a fluorescent 
compound. When the fluorescently labeled antibody is exposed to light of the 
proper wave length, its presence can be then be detected due to fluorescence 
Among the most commonly used fluorescent labeling compounds are fluorescein 
■sotntocyanate, rhodamine, onycoerythrin, phycocyanin, allophyc^yanin 
o-phthaldehyde and fluorescamine. 
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The antibody can also be detectably labeled using fluorescence emitting 
metals such as ,S2 EU, or others of the lanthanide series. These metals can be 
attached to the antibody using such metal chelating groups as diethylenetriamine 
pentaacetic acid (EDTA). 

The antibody also can be detectably labeled by coupling it to a 
chemiluminescent compound. The presence of the chemiluminescent-tagged 
antibody is then determined by detecting the presence of luminescence that arises 
during the course of a chemical reaction. Examples of particularly useful 
chemiluminescent labeling compounds are luminol, isoluminol, theromatic 
acridinium ester, imidazole, acridinium salt and oxalate ester. 

Likewise, a bioluminescent compound may be used to label the antibody 
of the present invention. Bioluminescence is a type of chemiluminescence found 
in biological systems in which a catalytic protein increases the efficiency of the 
chemiluminescent reaction. The presence of a bioluminescent protein is 
determined by detecting the presence of luminescence. Important bioluminescent 
compounds for purposes of labeling are luciferin, luciferase and aequorin. 

An antibody molecule of the present invention may be adapted for 
utilization in a immunometric assay, also known as a "two-site" or "sandwich" 
assay. In a typical immunometric assay, a quantity of unlabeled antibody (or 
fragment of antibody) is bound to a solid support or carrier and a quantity of 
detectably labeled soluble antibody is added to permit detection and/or 
quantitation of the ternary complex formed between solid-phase antibody, antigen, 
and labeled antibody. 

Typical, and preferred, immunometric assays include "forward" assays in 
which the antibody bound to the solid phase is first contacted with the sample 
being tested to extract the antigen form the sample by formation of a binary solid 
phase antibody-antigen complex. After a suitable incubation period, the solid 
support or carrier is washed to remove the residue of the fluid sample, including 
unreacted antigen, if any, and then contacted with the solution containing an 
unknown quantity of labeled antibody (which functions as a "reporter molecule"). 



WO 98/02455 



PCT/US97/12100 



-50- 

After a second incubation period to permit the labeled antibody to complex with 
the antigen bound to the solid support or carrier through the unlabeled antibody, 
the solid support or carrier is washed a second time to remove the unreacted 
labeled antibody. 

In another type of "sandwich" assay, which may also be useful with the 
antigens of the present invention, the so-called "simultaneous" and "reverse" 
assays are used. A "simultaneous" and "reverse" assays are used. A simultaneous 
assay involves a single incubation step as the antibody bound to the solid support 
or carrier and labeled antibody are both added to the sample being tested at the 
same time. After the incubation is completed, the solid support or carrier is 
washed to remove the residue of fluid sample and uncomplexed labeled antibody. 
The presence of labeled antibody associated with the solid support or carrier is 
then determined as it would be in a conventional "forward" sandwich assay. 

In the "reverse" assay, stepwise addition first of a solution of labeled 
antibody to the fluid sample followed by the addition of unlabeled antibody bound 
to a solid support or carrier after a suitable incubation period is utilized. After a 
second incubation, the. solid phase is washed in conventional fashion to free it of 
the residue of the sample being tested and the solution of unreacted labeled 
antibody. The determination of labeled antibody associated with a solid support 
or carrier is then determined as in the "simultaneous" and "forward" assays. 

Having generally described the invention, the same will be more readily 
understood by reference to the following examples, which are provided by way of 
illustration and are not intended as limiting. 

Example 1: TIF2 Cloning and Expression 

In keeping with previous reports (Halachmi, S. et aL, Science 264:1455- 
1458 (1994); Cavailles, V. et al, Proc. Nail. Acad ScL USA 97:10009-10013 
(1994); Kurokawa, R. etal t Nature 377:451-454 (1995)), we observed agonist- 
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dependent binding in vitro of a 160-kDa protein from 3S S-labelled whole cell 
extracts (HeLa, Cos-1, P19.6, MCF-7) to the glutathione-S-transferase (GST)- 
tagged LBDs of retinoic acid (RAR) and estrogen (ER) receptors (FIG. 2a). One 
cDNA clone, identified by screening 340,000 clones of a human placenta cDNA 
expression library with an estradiol (E2)-bound 32 P-labelled ER(DEF) probe, 
encoded a protein fragment (TEF2.1) that interacted on Far-Western blots with 
three different ^-labelled NR LBDs (ER, RAR, RXR) in an agonist-dependent 
manner (not shown), and could therefore correspond to the above 160-kDa 
protein. The TIF2 coding sequence (FIG. 3a), preceded by in-frame stop codons 
5' of the initiator AUG, was obtained upon rescreening with a TIF2.1 cDNA 
probe. Human TIF2 cDNA encodes a 159,160 Da protein (1,464 amino acids), 
which includes N-terminal putative nuclear localization signals (NLSs), one Gln- 
and three Ser/Thr-rich regions, and two charged clusters (FIG. 3). Some regions 
of TIF2 show significant sequence similarities with the recently described (Onate, 
S.A. et al, Science 270:1354-1357 (1995)) steroid receptor coactivator SRC-1 
(FIG. 3). TIF2 appears to be widely expressed, since the corresponding transcript 
was found in several human tissues, albeit at a much lower level in kidney (FIG. 2c 
and not shown). 

Immunodepletion studies strongly support that TIF2 is the 160-kDa 
protein species which interacts in an agonist-dependent manner with NR LBDs 
(see above and Halachmi, S. et al, Science 264: 1455-1458 (1 994); Cavailles, V. 
et al., Proc. Natl Acad. Sci. USA 97:10009-10013 (1994); and Kurokawa, R. et 
al. Nature 377:451-454 (1995)). Western blotting with a rabbit antiserum (pa- 
TIF2), raised against bacterially expressed TIF2.1, revealed predominantly a 1 60- 
kDa HeLa cell protein that interacted with agonist-bound GST-ER(DEF) (FIG. 
2b, lanes 1 and 2; see also legend to FIG. 2b). Immunodepletion with a mouse 
monoclonal TIF2 antibody (ma-TIF2) prior to affinity purification resulted in a 
specific decrease of TIF2, but not TIF1 (Le Douarin, B. et al, EMBO J. 14:2020- 
2033 (1995)) amounts, retained on E2-bound GST-ER(DEF) (FIG. 2b, compare 
lanes 2 with 4 and 6 with 8). Importantly, the subsequent Far-Western analysis 
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with an E2-bound 32 P-GST-ER(DEF) probe revealed the 160-kDa species only in 
control, but not TIF2-immunodepleted extracts (FIG. 2b, compare lanes 10 
and 12). 

Transiently expressed full-length TEF2 was nuclear and mainly associated 
with discrete bodies (FIG. 4a). Since the overexpressed TEF2.1 fragment was 
essentially cytoplasmic (supporting the above assignment of a N-terminal TIF2 
NLS), the interaction of TIF2.1 with NRs could be studied in mammalian cells 
using nuclear cotranslocation assays. In the absence of ligand, TIF2. 1 remained 
cytoplasmic and NRs were nuclear (for RARcc, ER and PR, see FIGs. 4b, d, g). 
Agonist exposure, however, resulted in all three cases in nuclear colocalization of 
TIF2.1 and NR, indicating NR-TIF2 interaction in vivo (FIGs. 4c, e, h). Agonist- 
dependent interaction of TIF2.1 with NRs was observed for all other receptors 
analyzed (RXR, TR, VDR, GR and AR; not shown). Interestingly, no interaction 
was detected between ER and TIF2.1 in presence of the ER AF-2 antagonist 
hydroxytamoxifen (OHT) (FIG. 4f), and the PR AF-2 antagonist RU 486 reversed 
the R5020-induced PR-TIF2.1 interaction (FIG. 4i). 

In agreement with the Far-Western blot experiments NRs and TIF2 
directly interacted, as purified TIF2.1 protein bound in vitro in the presence of an 
agonist to GST-ER(DEF), GST-RARcc(DEF), GST-RXRa(DE) and GST- 
TR(DE) (FIG. 4k, 1, m, lanes 3 and 4; FIG. 4n, lanes 5 and 6). As expected, 
TIF2 binding to GST-RXRa(DE) occurred with 9cis-RA (9C-RA) but not all 
-trans-RA (T-RA) (FIG. 4n, lanes 1 and 2), and OHT prevented E2-dependent 
binding of TIF2 to GST-ER(DEF) (FIG. 4n, lanes 7-9). The integrity of the 
conserved core of the ER, RARcc and RXRa AF-2 activating domains (AF-2 AD) 
which was shown to be critical for AF-2 activity (Le Douarin, B. et at, EMBO J. 
7^:2020-2033 (1995); Danielian, P.S. et ai, EMBO J. 77:1025-1033 (1992); 
Durand, B. et al, EMBO J. 73:5370-5382 (1994); and Gronemeyer, H. and 
Laudet, V., Protein Profile 2:1 173-1308 (1995), and therein), was required for 
TIF2 interaction in vitro. Most AF-2 AD core mutants which have lost AF-2 
activity (ER, FIG. 4k, lanes 5-8; RARcc, FIG. 41, lanes 5-10; RXRa, FIG. 4m, 
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lanes 5-8) did not detectably, or only weakly, associate with TIF2, whereas the 
GST-LBD fusion of the RXRa mutant E461Q, whose AF-2 is only partially 
impaired (Le Douarin, B. etal, EMBOJ. 7*2020-2033 (1995)), still exhibited 
a significant RA-dependent interaction with TIF2. 1 in vitro (FIG. 4m, lanes 9 and 
10). No significant interaction of TIF2.1 was observed with either GST-VP16 
(acidic activation domain), GST-TBP, GST-TFHB, or a series of GST-TAFs 
(hTAF„18, hTAF„20, hTAF„28 and hTAF„55; see Jacq, X. et al. Cell 79:\01-\\1 
(1994); Mengus, G. etal, EMBOJ. 7*1520-1531 (1995)) (not shown). 

Conceptually, a TIF capable of mediating the transcriptional activity of a 
cognate AF to the transcription machinery, could itself be an activator when fused 
to a heterologous DNA-binding domain. Interestingly, in transiently transfected 
HeLa cells, TIF2.1 fused to the GAL4 DNA-binding domain strongly 
transactivated a GAL4 reporter (FIG. 5a). Thus, TIF2 may correspond to one of 
the hypothetical limiting factor(s) previously proposed to be involved in NR 
transcriptional interference/squelching (Meyer, M.-E. et al. Cell 57A33-442 
(1989); Bocquel, M.-T. etal., Nucl. Acids Res. 77:2581-2595 (1989); Tasset, D. 
etal. Cell 1177-1 187 (1990)). Supporting this possibility, "anti-squelching" 
experiments showed that expression of TIF2.1 in ER-transfected cells could, at 
least partially, reverse the transcriptional autointerference (Bocquel, M.-T. et al., 
Nucl. Acids Res. 77:2581-2595 (1989)) generated by expressing increased 
amounts of ER (FIG. 5b; note the marked shift of the bell-shaped activation curve 
to higher ER concentrations in the presence of TIF2. 1). At high ER expression 
levels, the TIF2. 1 -stimulated transactivation decreased, possibly due to squelching 
of other putative mediators (Jacq, X. etal., Cell 79:107-117 (1994); Lee, J.W. et 
al.. Nature 374:91-94 (1995); Le Douarin, B. et al., EMBO J 7*2020-2033 
(1995); vom Baur, E. et al, EMBO J. 75:110-124 (1996); Lee, J.W. el al. 
Endocrinology 9:243-254 (1995); CavaiUes, V. etal, EMBOJ. 14:314\-31S\ 
(1995); Onate, S.A. etal, Science 270: 1354-1357 (1995)) and/or transcriptional 
factors. 



WO 98/02455 



PCT/US97/12100 



-54- 



As expected, coexpression of TIF2.1 with antagonist-bound NR did not 
lead to any stimulation of the transactivation brought about by AF-1 in the 
presence of pure AF-2 antagonists (Berry, M. et al, EMBO J. 9:2811-2818 
(1990); Meyer, M.E. etai, EMBO J. 9:3923-3932 (1990)), further supporting 
that TIF2 is AF-2-specific (FIG. 5b and 5e for ER, OHT; FIG. 5d for PR, 
RU486). TIF2 expression also increased AF-2/agonist-mediated transactivation 
by the androgen (AR) and progesterone (PR) receptors, but not transactivation 
by GAL-VP16 and GAL-AP2 (FIG. 5c). Under similar conditions, transactivation 
by GAL-RAR, GAL-RXR, GAL-VDR, GAL-TR and GAL-GR were unaffected 
by TIF2 (FIG. 5c, and not shown), suggesting that for these NRs either TIF2 is 
not critically involved in mediating their AF-2 activities or endogenous TIF2 
amounts are sufficient to optimally support transactivation, for instance, because 
TIF2 has a higher affinity for these receptors. TIF2-stimulation is to some extent 
affected by the promoter environment of the responsive gene, as the TEF2 effect 
on PR/5020-induced transactivation was greater for a complex (MMTV) than for 
a minimal (GRE-TATA) promoter, although the latter was also reproducibly 
stimulated (FIG. 5d). As expected from the distinct levels of TIF2 transcripts in 
different tissues (FIG. 2c), the effect of TIF2 was cell type-dependent, since TIF2 
had a much stronger effect on PR- and ER-induced transactivations in Cos-1 than 
in HeLa cells (FIGs. 5d, e). 

Squelching (Meyer, M.-E. etai. Cell 57:433-442 (1989); Bocquel, M.-T. 
et al, Nucl. Acids Res. 77:2581-2595 (1989); Tasset, D. el al. Cell 62:1177- 
1 187 (1990)) and structural studies (Bourguet, M. et al. Nature 375:377-382 

(1995) ; Benaud, J.-P. etai, Nature 575:681-689 (1995); Wagner, R.L. et al., 
Nature 378:690-697 (1995); Wurtz, J.-M; et al., Nature Struct. Biol. 3:87-94 

(1996) ) have supported a model in which binding of the ligand to the LBD of NRs 
results in conformational changes generating the surfaces) required for interaction 
with transcriptional intermediary factors (TIFs/mediators) which transduce the 
AF-2 activity to the transcription machinery. Conceptually such mediators should 
exhibit the following properties: (i) they should bind to agonist-, but not 
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antagonist-bound NR LBDs, (ii) their binding should be prevented by mutations 
abolishing AF-2 activity, (iii) they should collectively exhibit a transactivation 
functions), (iv) their expression should relieve AF-2 autosquelching, and (v) their 
overexpression should stimulate the activity of AF-2, whenever they are present 
in limiting amounts. The present study is the first report of a bona fide mediator 
of NR AF-2s which exhibits all these properties. 

Example 2: Production of TIF-2 Antibodies 

The following TIF2 antibodies were made using known techniques, unless 
otherwise specified below. See, e.^Ausubel et al. eds., CURRENT 
PROTOCOLS IN MOLECULAR BIOLOGY, Greene Publishing Assoc. and Wiley 
Interscience, N.Y., (1987-1996); Harlow and Lane ANTIBODIES: A 
LABORATORY MANUAL Cold Spring Harbor Laboratory (1988); and Colligan 
et al., eds., Current Protocols in Immunology, Greene Publishing Assoc. and 
Wiley Interscience, N.Y., (1992-1996), the contents of which references are 
incoporated entirely herein by reference. 

Polyclonal Antibodies. The TIF2.1 coding sequence (amino acids 624- 
1287) was cloned into pET15b and the resulting plasmid transformed into the E 
coli strain BL21 (DE3). After overexpression of the (His) 6 -TIF2. 1 protein the 
bacteria were lysed and the protein purified from the crude extract via affinity 
chromatography on a HiTrap chelating column (Pharmacia) as described (see 
Bourguet et al., Prot. Expr. Purif. 5:604-608 (1995) for technical details) 
Ahquots of the purified protein (50ug) in emulsion (Freund's Adjuvant) were 
injected (only once) into a New Zealand rabbit using a multisite intradermal 
injection protocol. Antisera obtained from serial bleeds revealed a single band of 
about 160 kDa on Western blots of extracts from Cos-1 cells transformed with a 
full length TIF2 expression vector. 

Monoclonal Antibodies A 20mer amino-acid peptide of T1F2 (with an 
added C-terminal cysteine) was selected on the basis of its potential immunogenic 
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characteristics in terms of hydrophilicity, flexibility, surface probability and the 
'antigenic index' according to software programmes Plot and Peptidestructure 
from the GCG package. 

The chosen peptide corresponds to the N terminal fragment encoded by 
the TIF2 partial cDNA initially isolated (which encodes amino acids 624 to 1287) 
and corresponds to amino acids 624 to 643 of the total protein (SEQ ID NO:2): 

ERADGQSRLHDSKGQTKLLQ(C) (SEQ ID NO:4) 
624 643 

The peptide was coupled to ovalbumin via the additional cysteine using the 
MBS heterobifunctional crosslinker. Injections were performed in BALB/c mice 
intraperitonally and intravenously. 

Spleen cells from the immunized mice were fused to the Sp2/0 Agl4 
myeloma. Growing hybridomas were first screened by ELISA using the 
recombinant TIF2.1 protein and the free peptide. Positive cultures were then 
tested by immunocytofluorescence on Cos cells transfected with TEF2.1 (in the 
pSG5 vector) as well as by western blot using the transfected Cos cell extracts and 
HeLa nuclear extract. The positive cultures were also tested for their ability to 
immunoprecipitate the TIF2. 1 protein from Cos transfected cells. 



Cultures were cloned twice on soft agar. 5 hybridomas have been 
established, 4 secreting IgG„ k and 1 IgG 2a) k antibodies, as shown in the Table: 



Hybridoma 


ELISA 


Immuno 
fluoresc. 
(Cos) 


Western 
blot 
(Cos) 


Immuno 
precip. 
(Cos) 


lTi-lB6 (IgG,, k) 


+ 


+ 


+ 


+ 


1TMC9 (IgG„ k) 


+ 


+ 


+ 




1TMD8 (IgG,, k) 








+ 


lTMD12(IgG„K) 


+ 




+4- 


+ 


1TMG3 (IgG 2t ,K) 




+ 


++ 


+ 
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Example 3: Identification o/TIF2 NID 

All recombinant DNA work was performed according to standard 
procedures (Ausubel, F.M. elal, Current Protocols in Molecular Biology, John 
Wiley & Sons, New York (1993)). GST-fusions of nuclear receptors were 
expressed from the following previously described plasmids: GST (pGEX2T: 
Pharmacia), GST-ER (pGEX2T-hERa(DEF), also called pGEX2THE14G, amino 
acids 282-595), GST-RXR (pGEX2T-mRXRa(DE), amino acids 205-467), and 
GST-RAR (pGEX2T-hRARa(DEF), amino acids 153-462) (all LeDouarin, B. ei 
ai, EMBOJ. 74:2020-2033 (1995a)). 

To further delineate the TIF2 NR interaction domain (NTD), we studied 
the interaction between a series of TIF2 deletion mutants and the ER or RARa 
LBDs, using GST^fusion protein-based in vitro assays. In both cases, an NID was 
mapped to the central region of TJJ2 (amino acids 624-869 in mutant TIF2.5; see 
FIGS. 7a and b). No additional NID could be identified in the N- or C-termini of 
TIF2 (FIGS. 7a and b; mutants TIF2.0, TIF2.2 and TIF2.7). Note that, in 
contrast, SRC-1, a paralogue of TIF2, apparently harbours two distinct non- 
contiguous NIDs located in the central and C-terminal regions (Onate, S. A. el ai, 
Science 270:1354-1357 (1995); Yao, T.P. et ai, Proc. Natl. Acad Sci. USA 
93: 10626-10631 (1996); Zhu, Y. et al. Gene Expression 6:185-195 (1996)). 

To further narrow down the TIF2 NID, TIF2.5 was C-terminally truncated 
to Pro775, yielding TIF2.34 which also interacted with ER and RARa LBDs in 
a ligand-dependent manner (FIGS. 8a and c). Upon further truncation to Ser697, 
the resulting mutant still interacted with both ER and RARa LBDs but, 
surprisingly, a ligand-dependent interaction was also found with TIF2.36 (FIGS. 
8a and c), thus indicating that the TIF2 NED is composed of at least two 
autonomous NO-interacting modules. 

An alignment of the TD?2 NID amino acid sequence present in TIF2.34 
with the corresponding region of SRC-1 (Onate, S.A. etal. Science 270:1354- 
1357 (1995)) revealed three highly conserved regions (FIG. 8a). Interestingly, all 
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three contain the motif LxxLL (SEQ ED NO: 12)(FIG. 8b), originally identified in 
the so-called "nuclear receptor box" (NR box) of TIFIa as the LxxLLL (SEQ ID 
NO: 13) motif, which was also present in RIP140 (Cavailles, V. eial, EMBO J. 
7*3741-3751 (1995)) and TRTP3 (Lee, J.W. et al, (1995)) (see LeDouarin, B. et 
al, EMBO J. 75:6701-6715 (1996) and FIG. 8b). Importantly, 10-amino-acid 
sequences comprising the TIFIa or RP140 NR boxes were sufficient for functional 
interaction with RXR in a ligand- and AF-2 AD-integrity-dependent manner, and 
mutation of the leucines at position 4 and 5 (LL->AA) of the TIFIa LxxLLL (SEQ 
ID NO: 13) motif abrogated TIFla-RXR interaction (LeDouarin, B. et al., EMBO 
J. 75:6701-6715 (1996)). The functionality of the RIP140 NR box was recently 
confirmed (Heery, D.M. et al., Nature 357:733-736 (1997)). 

To investigate the functional significance of these three motifs in the TDF2 
NTD, the above LL-»AA mutation was introduced in TIF2.1, either into each 
motif alone or in all possible combinations of the three motifs (TIF2.1ml to m!23 
in FIG. 8a; numbers following "m" refer to the mutated motifs). Mutation of all 
three MR boxes was required to abrogate both the binding to ER, RARa and 
RXRa (FIG. 8d, quantitation in FIG. 8e, white bars) and TIF2-dependent 
stimulation of ligand-induced transactivation by ER and RXRa AF-2s (FIG. 8e, 
black bars, and data not shown; in the case of RARa AF-2 the stimulation was 
weak and was not quantified). TIF2.1 constructs in which two NR box motifs 
were mutated still exhibited ER binding and stimulation of AF-2 activity, in 
particular when the NR box motif II was intact, suggesting that the three NR box- 
containing modules are at least in part, functionally redundant (FIGS. 8d and e; 
TIF2. Iml2, ml 3, and m23). This redundancy was obvious when only one NR 
box motif was mutated; in contrast to TIFIa, which contains only one NR box. 
(LeDouarin, B. eial, EMBO J. 75:6701-6715 (1996)), mutation of a single TIF2 
NR box did not abrogate ER binding and stimulation of ER AF-2 activity. All 
three mutants (TIF2.1ml to m3) bound to ER and stimulated estradiol-dependent 
transactivation by the ER with similar efficiency as TIF2. 1 itself (FIGS. 8d and 
e). In the case of RARa and RXRa the mutations had, in general, a more 
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deleterious effect on receptor LBD binding and stimulation of AF-2 activity than 
in the case of ER (FIGS. 8d and e). This may possibly reflect a weaker 
interaction between TIF2 and either RARa or RXRa than with ER. However, in 
spite of exhibiting in genera! a lower activity, the patterns of NR binding and 
stimulation of AF-2 activity of the NR box mutants were similar for ER and 
RARa and RXRa, as mutation of motif II was always more detrimental in double 
mutants than mutation of motifs I and m (see FIG. 8e). Importantly, for both ER 
and RXRa there was a good correlation between the effect of any of the various 
mutations on TIF2. 1 -receptor binding in vitro and TIF2. 1 -mediated stimulation 
of AF-2 activity (FIG. 8e), supporting a mechanism whereby the stimulation of 
AF-2 activity by TIF2 involves T1F2-NO interaction through NR hoIo-LBD-TIF2 
NR box interface(s). 

Our present structure-function analysis reveals that TIF2 contains a single 
nuclear receptor-interaction domain (NID). Note in contrast to the other TIF2 
family member, SRC-1, which was reported to contain two NIDs (Onate, S A et 
al. Science 270:1354-1357 (1995); Yao, T.P. et al., Proc. Natl Acad. Sci. USA 
93: 10626-10631 (1996); Zhu, Y. et al., Gene Expression 5:185-195 (1996)) 
Note, however, that one of the two SRC-1 NID is most probably homologous to 
the TIF2 NID characterized here (see FIG. 8a). The TIF-2 NID is composed of 
three modules, and each can bind independently in a ligand-dependent manner to 
the NRs tested in this study. Interestingly, these modules contain the NR box 
motif LxxLL (SEQ ID NO: 12), which was originally recorded (as the motif 
LxxLLL (SEQ ID NO: 13), FIG. 8b) within a 10 amino acid NR binding module 
of TIFla which was found to be conserved in the NID of RIP140 and also present 
in TRIP3 (LeDouarin, B. etal, EMBOJ. /5. 6701-6715 (1996) and refs therein) 
Moreover, the TIFla and RIP140 modules were shown to functionally interact 
with NRs (LeDouarin, B. etal, EMBOJ. /5.-6701-67I5 (1996)). The implication 
of NR box motifs in NO-coactivator binding and their presence in a number of 
different coactivators was pointed out in recent reports (Heery, D.M et al. 
Nature 3*7:733-736 (1997); Torchia, J. et al. Nature 357:677-684 (1987) 
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LeDouarin, B. el ai, EMBO J. 75:6701-6715 (1996)). Note that all three TIF2 
NR box motifs described here are conserved in the recently discovered TEF2 
paralogue p/CIP (Torchia, J. et ai, Nature 387:677-684 (1 987)). In contrast to 
TIFla, for which mutation of leucines at positions 4 and 5 to alanine of its single 
NR box motif (LL-*AA) abrogates NR binding, mutation of all three motifs are 
required in the TIF2 NTD to abrogate NR binding, indicating that each of these 
motifs can contribute to a TIF2 surface that interacts with a cognate surface of NR 
holo-LBDs. That the NR boxes of TIF2 exhibit functional redundancy is 
supported by the observation that the LL-+AA mutation in any of the three TIF2 
NID motifs apparently did not (in the case of the ER) or only weakly (in the case 
of RARa, RXRa) reduce the efficiency of NR interaction. Moreover, in the 
TIF2. 1 environment, any single intact NR box motif on its own (i.e., when the two 
other motifs were mutated) was sufficient for interaction with the holo-ER LBD, 
although only motif II on its own could bring about a nearly wild-type NR binding 
efficiency. In contrast, for RARa and RXRa interaction, mutants with single 
intact NR boxes were 5 (box II) to 20 (box I or HI) times less efficient than the 
wild-type TIF2 containing the three NR boxes. Crystallography studies will be 
necessary to distinguish between two possible models, in which the three NR box 
motifs (i) contribute to the formation of a tripartite NID surface that specifically 
recognizes a cognate holo-NO LBD surface, or (ii) belongs to independent 
surfaces which each can interact, albeit with different efficiencies, with the same 
holo-NO surface. Note, however, that the second model could allow TIF2 to 
interact cooperatively with both partners of NR homo- or heterodimers, thus 
rendering transactivation by NRs sensitive to small variations in TTF2 levels. Note 
also that, for both ER and RXRa, the effects of NR box mutations on NR binding 
and stimulation of AF-2 activity were correlated, further supporting the conclusion 
that the transcriptional effect of TIF2 involves the formation of a NR box-NO 
LBD interaction interface. 

The motif LxxLL (SEQ ID NO: 12) has been found in a number of other 
NR coactivators (see above), thus suggesting some similarity in the mode of NO- 
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coactivator interactions. However, this does not exclude NO-specific modulation 
of these interactions, as the NR. box surrounding sequences are highly variable. 
In this respect, note also that TIF2 NR boxes II and III are predicted to form <x- 
helices, while NR box I is predicted to fold into a P-sheet structure (structure 
predictions according to SOPMA; Geourjon and Deleage, 1994). 

Example 4: Identification oftheADl andAD2 activation 

domains ofTIF2 

Transient transfection assays with a GAL4 reporter plasmid and chimeras 
containing various TIF2 fragments linked to the GAL4 DNA binding domain 
(DBD) demonstrated the presence of two autonomous transcriptional activation 
domains in the C-terminal 460 amino acids of TIF2, termed AD1 and AD2 
(delineated by mutants TTF2.8, TTF2.12 and TTF2.2 in FIGS. 7a and c). Transient 
transfections of HeLa and Cos-1 cells were performed as described (Gronemeyer 
et al., (1987); Bocquel, M.T. etai, Nucl. Acids Res. 77:2581-2595 (1989)). 

The N-terminal ADl (amino acids 1010-1 131), which is present in TIF2. 1, 
showed a stronger activity than the C-terminal AD2 (amino acids 1288-1464) 
(compare TIF2.8 and TIF2. 12 with TEF2.2 in FIGS. 7a and c). Note, however, 
that the weaker activity of AD2 (relative to AD1) could be due to a lower 
expression level of the GAL-TIF2.2 fusion protein (compare with GAL-TIF2.8 
and GAL-TIF2.12 in FIG. 7c). Both TIF2 activation functions were active in 
Cos-1 and HeLa cells (FIG. 7c). Notably, the minimal ADl (TIF2.8) and AD2 
(TIF2.2) constructs exhibited some cell-specific activities, as GALTIF2.8 was 
more active in HeLa than in Cos cells, whereas the opposite was observed for 
GAL-TIF2.2 (FIG. 7c). Interestingly, the glutamine-rich region of TIF2 could 
neither activate transcription on its own when fused to the GAL4 DBD (FIGS. 
7a and c; mutant TIF2.6), nor was it required for transcriptional activation by 
ADl or AD2. No activation function could be detected in the N-terminal part of 
TEF2 (see FIGS. 7a and c; mutant TIF2.0). 



WO 98/02455 



-62- 



PCT/US97/12100 



We conclude from these data that the nuclear receptor interacting domain 
(NID) and the two transcriptional activation functions of TIF2 correspond to 
distinct modular domains, since TIF2.5 can bind to NRs, but cannot activate 
transcription, whereas TIF2.2 and TIF2.8 cannot bind NRs but are able to activate 
transcription (FIG. 7a). 



Example 5: Characterization of the TIF2AD1 andAD2 

activation domains 

Recently CBP and p300, originally identified as coactivators of the 
transcription factor CREB, were shown to act as general integrators of multiple 
signaling pathways, including activation via agonist-bound RARa and TR (for 
reviews and refs see Eckner, R., Biol. Chem. 377:685-688 (1996); Janknecht & 
Hunter,(1996); Glass, C.K. et al, Current Opin. Cell Biol. 9.222-232 (1997); 
Shikama,N. e /a/., Trends in Cell Biol. 7:230-236(1997)). Furthermore, it was 
reported that SRC-1, which belongs to the same gene family as TIF2, interacts 
with CBP and p300 (Kamei, Y. eiaL, Cell 55.403-414 (1996); Yao, T.P. etal., 
Proc. Natl Acad. Sci. USA 93: 10626- 10631 (1996); Hanstein, B., etal, Proc. 
Natl. AcacLSa USA P5: 11540-1 1545 (1996)). Using GST-fusion protein-based 
interaction and animal cell-based two hybrid assays, we therefore analyzed 
whether TIF2 could also interact with CBP. In the two hybrid system only the 
central TIF2.1 fragment, but not the N-terminal TIF2.0 or the C-terminal TIF2.2 
fragments (FIG. 7a), scored positive for interaction with GAL-CBP (containing 
amino acids 1872-2165 of CBP, which encompass the SRC-1 interacting domain 
of CBP; FIG. 7e). A GST-CBP fusion protein was expressed in E. coli and used 
for pull-down assays with in vitro translated TIF2 polypeptides (FIG. 7d). TIF2 
did interact with CBP and, interestingly, the CBP-interacting domain (CID) 
apparently overlapped the AD1 activation domain of TIF2 (compare FIGS. 7a, 
c and d; mutants TIF2.8 and TIF2.12). The interaction of TEF2 with CBP was 
direct, as a purified E. coli-expressed TIF2.1 protein also interacted with GST- 
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CBP (data not shown). Only this region of TIF2 interacted significantly with the 
GST CBP fusion protein, thus suggesting that the TIF2 AD1 activity may 
originate from the recruitment of CBP. Purger N-terminal regions (FIGS. 7a and 
d; mutants TIF2.10 and TEF2.4) or the C-terminal AD2 activation domain (FIGS. 
7a and d; mutant TIF2.2) did not show any binding to GST-CBP. Note that 
TIF2.2 also did not interact win full-length CBP (data not shown), suggesting that 
the activity of TIF2 AD is mediated by (a) factor(s) distinct from CBP. 

To investigate whether the CED of TIF2 could be separated from the AD1 
activation domain, the ability of a series of GAL-TIF2 truncation mutants to 
activate a GAL4-reporter was compared with their ability to interact with the 
GST-CBP protein in vitro (FIG. 9). TIF2. 13 (which encompasses Pro 10u to 
Ser U22 ) exhibited a potent transcriptional activity, comparable to that of larger 
TTF2 fragments (compare FIGS. 7 and 9). Removal of 26 C-terminal (TIF2. 15) 
or 20 N-terminal (TIF2.18) amino acid residues reduced transcriptional activity 
only weakly (FIGS. 9a and b). Note that TEF2.13 also interacted with CBP /„ 
vivo, as shown by using a two hybrid assay in transfected mammalian cells (FIG 
10b). 

While the internal deletion of residues Asp I06I to Ala 1070 (TIF2. 1 9) had only 
a minor effect on the ability of TIF2.13 to transactivate, deletion of the Glu 1071 to 
Leu I080 segment (mutant TTF2.20) significantly reduced TIF2 AD 1 transcriptional 
activity. Notably, these residues belong to a sequence predicted to fold into an 
amphipathic a-helical structure which is highly conserved between TIF2 and SRC- 
1 (FIG. 9a). The involvement of this region in transactivation was confirmed by 
analysis of mutants TIF2.21 to TIF2.32 (FIGS. 9a and b). All constructs 
containing the TIF2 wild-type sequence from Asp 107S to Leu 1087 stimulated 
transcription, whereas even a deletion of only some of these residues significantly 
reduced transcriptional activation. However, on its own this a-helical peptide 
transactivated very poorly, and had to be incorporated into additional upstream 
and/or downstream TIF2 sequences to generate significant transcriptional activity 
(HGS. 9a and b; compare mutants TIF2. 13, TTF2.2 1 and TEF2.32). Importantly, 
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in all cases ADD activity coincided with CBP interaction, since transcriptionally 
inactive constructs did not interact with CBP (TIF2.24, TIF2.27 and TIF2.29 in 
FIGS. 9a-c), while transcriptionally active mutants also bound CBP. Moreover, 
the strength of the in vitro interaction with GST-CBP apparently correlated with 
transactivation efficiency (FIGS. 9a-c; e.g., compare TIF2.2 1 and TIF2.3 1). 

To investigate whether the integrity of the putative amphipathic a-helical 
region is required for both AD1 transcriptional activity and interaction with CBP, 
we introduced point mutations into TIF2. 13; the three conserved hydrophobic Leu 
or the hydrophilic Asp-Glu residues were converted to alanines [FIG. 9a; 
TIF2.13(LLL) and TIF2.13(DQ)]. Interestingly, mutation of the three leucine 
residues almost completely abolished AD1 activity, whereas mutation of the Asp- 
Glu sequence had very little, if any effect (FIG 10a). Again, AD1 activity and 
interaction with CBP in vitro (FIG. 10c), as well as in vivo (FIG. 10b), were 
strictly correlated, since GAL-TIF2.13(DQ), which transactivated as efficiently as 
wild type GAL-TIF2.13, interacted strongly with CBP, whereas the 
transcriptionally inactive GAL-TIF2. 1 3(LLL) interacted very weakly with CBP. 
Together these results indicate that (i) CBP mediates the AD1 activity of TIF2, 
(ii) a putative amphipathic a-helix motif located within the AD1 domain is 
critically involved in, but not sufficient for, efficient CBP binding/transactivation 
and (iii) the amphiphilicity of this motif is not required for AD1 activity and CBP 
binding. 

The two TIF2 activation functions ADl and AD2 apparently operate 
through different transcriptional activation cascades. While the TIF2 ADl 
activation domain could not be separated by mutational analysis from the TIF2 
domain which interacts in vitro and in vivo with a region of the CBP surface 
which also mediates SRC-1 binding (Kamei. Y. etal, Or//<SJ:403-414 (1996)), 
neither this region, nor full length CBP, interacted with TIF2 AD2. That the two 
TIF2 activation functions may operate through distinct pathways is also suggested 
by the differential cell specificity of the minimal fragments exhibiting ADl activity 
(e.g., TEF2.8, TIF2.7, TIF2.9, TIF2.12) and AD2 activity (TIF2.2). While TIF2.2 
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is more active in Cos-1 than in HeLa cells, all of the minimal fragments containing 
AD1 are more active in HeLa cells. Along the same lines, removal in TIF2.3 of 
sequences N-terminal of TIF2.8 resulted in a 10-fold and 2-fold increased 
transaction in Hela and Cos-1 cells, respectively, and removal in T1F2.1 of 
sequences C-terminal of TIF2.3 gave a 9-fold and 3-fold higher transactivation in 
Hela and Cos-1 cells, respectively. This suggests that the deleted sequences could 
exert either intra- or intermodular repression on TIF2 ADl activity/CBP 
interaction. In addition, the coactivator activity of TEF2 may be cell-specifically 
modulated by the differential efficiency of its two ADs and factors interacting with 
the sequences N- and C-terminally of the ADl activation function. 

It is worth noting that the core of ADl (TIF2.32) on its own is a very poor 
transactivator and CBP binder, and requires additional surrounding sequences to 
generate a folly active (i.e., efficient CPB binding) surface. However, mutational 
analysis of the ADl core in the context of a strong activator fragment (TIF2. 13) 
reveals the critical importance for transactivation and CBP binding in vivo and in 
vitro of three leucine residues (FIG. 10). These leucines belong to a conserved 
LLxxLxxxL (SEQ ID NO: 14) motif in all three members of the TIF2 coactivator 
family (FIG. 9a; Torchia, J. etai. Nature 387:677-684 (1987)) which is distinct 
from the LxxLL (SEQ ID NO: 12) NR. box motif. Notably, although these 
leucines are embedded in a predicted amphipathic a-helix, the amphiphilicity is not 
required for function, since a mutation of the hydrophilic residues (compare 
TEF2.13 with TIF2.13 (DQ)) does not affect transactivation/CBP binding. 

TIF2 can apparently fulfill at least two mediator functions, (i) as a 
"bridging factor" between the AF-2 function of nuclear receptors and CBP via its 
ADl activation domain and (ii) as a transcriptional mediator through as yet 
unknown CBP binding-independent route(s) via its AD2 function. Presently, we 
have no evidence that TIF2 could possess an intrinsic enzymatic activity; none of 
the bacterially-expressed purified TIF2 fragments, in particular TIF2. 1 which was 
shown to be fully competent in NR and CBP interactions in vitro, exhibited any 
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hi stone acetylase activity under conditions where bacterially-expressed purified 
yeast GCN5 was highly active (our unpublished results). 

Example 6: TIF2 enhancement ofNR AF-2 activity 

The observation that animal transcriptional activators, such as the human 
ER (Metzger, D. et al, Nature 33*31-36 (1988)), are also active in the yeast 
Saccharomyces cerevisiae demonstrated that the basic principles of transcriptional 
enhancement have been conserved from yeast to man. We therefore investigated 
whether TDF2 could enhance transcriptional activation by various NR constructs 
expressed in S. cerevisiae. Both, NRs and TBF2.1 were expressed from multicopy 
plasmids in the yeast strain PL3(a), which contains a URA3 reporter gene under 
the control of three estrogen response elements ((ERE) 3 -URA3; Pierrat, B et al, 
Gene 119:231-245 (1992)). 

In yeast, the hERoc constructs were expressed from the following YEp90- 
based plasmids: HEGO (hERa, YEp90-HEG0, amino acids 1-595), HE15 
(YEp90-HE15, amino acids 1-282), and HEG19 (YEp90-HEG19, amino acids 
179-595) (all Pierrat, B. etal., Gene 779:237-245 (1992)), HE179-338 (YEp90- 
HE 179-338; Pierrat, B. et al, Gene 743:193-200 (1994)). From the yeast 
multicopy plasmid pBLl (LeDouarin, B. etal, Nucleic Acids Res. 23:876-878 
(1995b)), which codes for ER(F)-epitope-tagged ER(C)-fusions, the following 
plasmids were expressed: ER(C)-RAR(DEF) (pBL 1 -hRARa(DEF), amino acids 
154-462) and ER(C)-RXR(DE) (pBLl -mRXRa(DE), amino acids 205-467) (both 
vom Baur, E. etal, EMBOJ. 75:1 10-124 (1996)). TIF2 was expressed in yeast 
from the multicopy plasmid pAS3 (gift from BLeDouann), which is a derivative 
of YEp90 containing the LEU2 marker. Yeast PL3(a) (Pierrat, B. et al, Gene 
779:237-245 (1992)) transformants were grown exponentially in the presence or 
absence of ligand for about five generations in selective medium containing uracil. 
Yeast extracts were prepared and assayed for OMP decase activity as described 
previously (Pierrat, B. etal, Gene 779:237-245 (1992)). 
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As expected from previous studies (Metzger, Nucl. Acids Res. (1992); 
Pierrat,B. etal. Gene 779:237-245 (1992); Pierrat, B. et al. Gene 7*3:193-200 
(1994)) the full length ER (HEGO) induced orotidine S'-monophosphate 
decarboxylase (OMP Decase) activity in a ligand-dependent manner (FIG. 11, 
lanes 1 and 3). Interestingly, the transcriptional activity of ER was further 
enhanced by coexpression of the TIF2. 1 fragment (FIG. 1 1, compare lanes 3 and 
4). In the absence of hormone, TIF2. 1 had no significant effect on ER-induced 
transcriptional activation (FIG. 11, compare lanes 1 and 2). Essentially the same 
results were observed for HEG19 which is devoid of the N-terminal region A/B, 
indicating that TJJF2 exerts its effect on the ligand-dependent ER AF-2 (FIG. 1 1, 
lanes 5-8). In contrast, neither the AF-1 activity of HE15, (which encompasses 
the ER regions A, Band C; Kumar & Chambon, Cell 55: 145- 156 (1988)), nor the ' 
AF-2a activity of the HE179-338 construct (Pierrat, B. etal. Gene 7*3:193-200 
(1994)) were stimulated by coexpressing TEF2. 1 (FIG. 1 1, lanes 9-12). This is in 
agreement with the results obtained in mammalian cells, and with the observation 
that an intact LBD is required for TIF2.1 to interact with the ER (Voegel, J.J. et 
al, EMBOJ. 75:3667-3675 (1996)). 

TIF2.1 also stimulated the AF-2 activity of the iiganded RXRa(DEF) 
region in yeast (FIG. 11, compare lanes 19 and 20). This enhancement was 
ligand-dependent; no activation via the RXRa(DEF) region was observed when 
the ER(C)-RXRoc(DEF) chimera was coexpressed with TIF2. 1 in the absence of 
ligand (FIG. 1 1, compare lanes 18 and 20). Again these observations parallel 
those made in HeLa and Cos-1 cells (see FIG. 12c). 

Surprisingly, even in the absence of ligand, and in contrast with the 
observations made with ER and RXRa, TIF2.1 very efficiently enhanced 
transaction by the RARcc LBD (FIG. 1 1, lanes 13 and 14). The addition of 
retinoic acid further increased this transcriptional activation (FIG. 11, lanes 14 
and 16). Note that, as previously reported (Heery, D.M. etal. Nature 387:733- 
736 (1997)), both RARcc and RXRa AF-2 on their own poorly activated 
transcription from the URA3 reporter. 
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The enhancement of AF-2 activity which was particularly strong in yeast 
ceUs, has also been recently observed for GREP1, the mouse homologue of TIF2 
(Hong, H. et al., Mol. Cell. Biol. 17:2735-2744 (1997)). These observations 
suggest that yeast cells contain coactivators which only poorly mimic the action 
of the mammalian NR. coactivators. As yeast cells apparently do not contain a 
CBP homologue, it will be interesting to investigate which yeast factor mediates 
the activity of TIF2. Note in this respect that GAL-TTF2.1 is a strong 
transactivator in yeast (our unpublished results). 

Interestingly, the expression of TIF2 in yeast led to a marked stimulation 
of transactivation by the unliganded ER(C)-RARoc(DEF), which was not observed 
with ER or RXRa unliganded LBDs. Structural studies have revealed that 
binding of the ligand results in a conformational change of the LBD, which 
generates the surface(s) for coactivator binding (Renaud, J.P. et al, Nature 
575.681-689 (1995)). Our present result, therefore, suggests that a high level of 
coactivators might in the absence of ligand drive the LBD of some receptors into 
a holo-LBD-like conformation, thus giving rise to ligand-independent 
transcriptional activity. By analogy, one could speculate that high levels of 
corepressors could "lock" NR LBDs in the apo-LBD conformation. It would 
therefore be interesting to investigate whether levels of coregulators might lead 
to constitutive activity (even in the presence of antagonists) or conversely to lack 
of inducibility of nuclear receptors in some pathological states. 

Example 7: TIF2 NID inhibition of NR AF-2 activity 

Overexpression of the TIF2. 1 fragment, which contains both the NID and 
AD1 activation function, stimulates the ER AF-2 activity of the ER LBD 
transiently in Cos-1 cells (FIG. 12a, lanes 2 and 3; Voegel, J.J. et al, EMBO J. 
75:3667-3675 (1996)). That this stimulation was due to a direct interaction 
between the ER LBD and the NID of TTF2, was strongly suggested by the 
observation that overexpression of the TIF2.5 mutant which contains the isolated 
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NID, but lacks ADl (see FIG. 7a) prevented the stimulatory effect of TIF2.1 
(FIG. 12a, compare lanes 3 and 4). Note that in the absence of TIF2. 1, TIF2.5 
overexpression also decreased the transactivation by the ER AF-2 which was 
presumably mediated through Cos- 1 endogenous coactivators (FIG. 12a, compare 
lane 4 with lane 2), thus suggesting that these Cos-1 mediators either correspond 
to endogenous TIF2s, or interact with the ER holo-LBD through surfaces which 
are identical to, or in direct vicinity of, the TIF2 NID interaction surface. 

We previously reported an agonist-dependent interaction of TIF2 with 
RAR and RXR LBDs, which was dependent on the integrity of the NR AF-2 AD 
core, but failed to observe a stimulatory effect of TIF2 on the transcription 
activation of a (17m) s -Globin-promoter-CAT reporter by GAL-RAR LBD or 
GAL-RXR LBD fusion proteins (Voegel, J.J. et al., EMBO J. 75:3667-3675 
(1996)). Since this failure was likely to be due to the presence of sufficient 
amounts of endogenous mediators for achieving maximal transactivation from this 
reporter gene, we modified the transfection conditions and used a reporter 
construct bearing a minimal promoter. A clear TBF2 and TIF2.1 stimulatory 
activity for RXRa AF2 was observed in HeLa and Cos-1 ceUs when using the 
(17m) 5 -TATA-CAT reporter (FIG. 12c, compare lanes 3-6 and 1 1-14). This 
stimulatory effect was less marked with RARa AF-2 and could be observed 
reproducibly only with the TIF2. 1 fragment in Cos-1 cells (FIG. 12d, compare 
lane 10 with lanes 13 and 14; note that TIF2.1 is expressed at >10 fold higher 
levels than TTF2; data not shown). The reporter plasmids (17m) s -TATA-CAT 
(May, M. etal, EMBO J. 75:3093-3104 (1996)) and 17M5-G/CAT ((17m) 5 -P- 
globin-CAT; Durand, B. etal, EMBO J. 75:5370-5382 (1994)) each contain five 
copies of the GAL4 response element in front of a simple TATA motif or of the 
P-globin promoter, respectively, upstream from the CAT reporter gene. 

Assuming that TIF2 or coactivators recognizing the TIF2 interacting 
surface on NR LBDs, generally mediate the AF-2 function of NRs, the NID 
containing TEF2.5 should exert its dominant negative activity not only on ER, but 
also on other NRs, independently of the cellular context. We therefore analyzed 
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the effect of TIF2.5 on the AF-2 activity of ER, RXRcc and RARa in HeLa and 
in Cos-1 cells (FIGS. 12b-d). In all cases, TIF2.5 expression led to a dose- 
dependent inhibition of the NR AF-2 activity, indicating that the endogenously 
present mediators were competed out by the isolated overexpressed TEF2 NID, 
and strongly suggesting that TIF2 or transcriptional intermediary factors 
recognizing the same or overlapping surfaces mediate NR AF-2 activity in these 
transfected cells (FIGS. 12b-d, compare lane 2 with lanes 7 and 8; lane 10 with 
lanes 15 and 16). 

It is important to stress that our present data demonstrate that TEF2 
interacts with NRs through a surface (NID) that is critical for NR AF-2 activity, 
as expression of TIF2.5 which encompasses the NID blocked the ligand-induced 
activity of all tested NR AF-2s. This observation which clearly establishes that, 
at least in transfected cells, TIF2 or other coactivators which interact with an 
overlapping, if not identical, holo-LBD surface, are essential to mediate the NR 
AF-2 activation function. This is in keeping with the presence of three NR box 
motifs in the TEF2 NED, and of at least one conserved LxxLL (SEQ ID NO: 12) 
NR box motif in all coactivators described to date. 

It will be apparent to those skilled in the art that various modifications and 
variations can be made in the methods of the present invention without departing 
from the spirit and scope of the invention. Thus, it is intended that the present 
invention cover all modifications and variations of this invention provided they 
come within the scope of the appended claims and their equivalents. 

The disclosures of all patents, patent applications, and publications referred 
to above are hereby entirely and expressly incorporated herein by reference. 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION ; 

(i) APPLICANT: INSTITOT NATIONAL DE LA SANTE ET DE LA RECHERCHE MEDICALE 
101 rue de Tolbiac 
75654 Paris Cedex 13 
France 

CENTRE NATIONAL DE LA RECHERCHE SCIENTIFIQUE 

3 rue Michel Ange 
75794 Paris Cedex 16 
France 

UNI VERS ITE LOUIS PASTEUR 

4 rue Blaise Pascal, BP 1032F 
67070 Strasbourg Cedex 
France 

BRISTOL -MYERS SQUIBB COMPANY 
Route 206 & Provinceline Road 
Princeton, New Jersey 08543-4000 
United State of America 

INVENTORS: Chambon, Pierre 

Gronemeyer, Hinrich 
Voegel, Johannes 
Lutz, Yves 

(ii) TITLE OF INVENTION: Transcriptional Intermediary Factor-2 
(iii) NUMBER OF SEQUENCES: 14 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Sterne* Kessler, Goldstein & Fox P L L C 

(B) STREET: 1100 New York Avenue, NW, Suite 600 * * 

(C) CITY: Washington 

(D) STATE: DC 

(E) COUNTRY: USA 

(F) ZIP: 20005-3934 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC - DOS /MS - DOS 

SOFTWARE : Patentln Release #1.0, Version #1.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: To be assigned 

(B) FILING DATE : Herewith 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 60/021,247 

(B) FILING DATE: 12-JUL-1996 
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(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Steffe, Eric K. 

(B) REGISTRATION NUMBER: 36,688 

(C) REFERENCE /DOCKET NUMBER: 1383 . 013PC01/EKS 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 202-371-2600 

(B) TELEFAX: 202-371-2540 

(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6156 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME / KEY : CDS 

(B) LOCATION: 163.. 4554 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO:l: 

GGCGGCCGCA GCCTCGGCTA CAGCTTCGGC GGCGAAGGTC AGCGCCGACG GCAGCCGGCA 60 

CCTGACGGCG TGACCGACCC GAGCCGATTT CTCTTGGATT TGGCTACACA CTTATAGATC 120 

TTCTGCACTG TTTACAGGCA CAGTTGCTGA TATGTGTTCA AG ATG AGT GGG ATG 174 

Met Ser Gly Met 
1 

GGA GAA AAT ACC TCT GAC CCC TCC AGG GCA GAG ACA AGA AAG CGC AAG 222 
Gly Glu Asn Thr Ser Asp Pro Ser Arg Ala Glu Thr Arg Lys Arg Lys 
5 10 15 20 

GAA TGT CCT GAC CAA CTT GGA CCC AGC CCC AAA AGG AAC ACT GAA AAA 270 
Glu Cys Pro Asp Gin Leu Gly Pro Ser Pro Lys Arg Asn Thr Glu Lys 

25 30 35 

CGT AAT CGT GAA CAG GAA AAT AAA TAT ATA GAA GAA CTT GCA GAG TTG 318 
Arg Asn Arg Glu Gin Glu Asn Lys Tyr He Glu Glu Leu Ala Glu Leu 
40 45 50 

ATT TTT GCA AAT TTT AAT GAT ATA GAC AAC TTT AAC TTC AAA CCT GAC 366 
He Phe Ala Asn Phe Asn Asp He Asp Asn Phe Asn Phe Lys Pro Asp 
55 60 65 

AAA TGT GCA ATC TTA AAA GAA ACT GTG AAG CAA ATT CGT CAG ATC AAA 414 
Lys Cys Ala He Leu Lys Glu Thr Val Lys Gin He Arg Gin He Lys 
70 75 80 

GAA CAA GAG AAA GCA GCA GCT GCC AAC ATA GAT GAA GTG CAG AAG TCA 462 
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Glu Gin Glu Lys Ala Ala Ala Ala Asn lie Asp Glu Val Gin Lys Ser 
85 90 95 100 

GAT GTA TCC TCT ACA GGG CAG GGT GTC ATC GAG AAG GAT GCG CTG GGG 
Asp Val Ser Ser Thr Gly Gin Gly Val He Asp Lys Asp Ala Leu Gly 
1°5 HO 115 

CCT ATG ATG CTT GAG GCC CTT GAT GGG TTC TTC TTT GTA GTG AAC CTG 
Pro Met Met Leu Glu Ala Leu Asp Gly Phe Phe Phe Val Val Asn Leu 
120 125 X3Q 

GAA GGC AAC GTT GTG TTT GTG TCA GAG AAT GTG ACA CAG TAT CTA AGG 
Glu Gly Asn Val Val Phe Val Ser Glu Asn Val Thr Gin Tyr Leu Arg 
135 140 145 

TAT AAC CAA GAA GAG CTG ATG AAC AAA ACT GTA TAT AGC ATC TTG CAT 
Tyr Asn Gin Glu Glu Leu Met Asn Lys Ser Val Tyr Ser He Leu His 
150 155 160 

GTT GGG GAC CAC ACG GAA TTT GTC AAA AAC CTG CTG CCA AAG TCT ATA 
Val Gly Asp His Thr Glu Phe Val Lys Asn Le u Leu Pro Lys Ser He 
165 170 175 180 

GTA AAT GGG GGA TCT TGG TCT GGC GAA CCT CCG AGG CGG AAC AGC CAT 
Val Asn Gly Gly Ser Trp Ser Gly Glu Pro Pro Arg Arg Asn Ser His 
185 190 195 

ACC TTC AAT TGT CGG ATG CTG GTA AAA CCT TTA CCT GAT TCA GAA GAG 
Thr Phe Asn Cys Arg Met Leu Val Lys Pro Leu Pro Asp Ser Glu Glu 
200 205 210 

CI? rf ^ T T ^ ^ GCT ^ AAA TAT GAA ACT ATG CAG 

Glu Gly H 1S Asp Asn Gin Glu Ala His Gin Lys Tyr Glu Thr Met Gin 
215 220 225 

TGC TTC GCT GTC TCT CAA CCA AAG TCC ATC AAA GAA GAA GGA GAA GAT 
Cys Phe Ala Val Ser Gin Pro Lys Ser He Lys Glu Glu Gly Glu Asp 
230 235 240 

TTG CAG TCC TGC TTG ATT TGC GTG GCA AGA AGA GTT CCC ATG AAG GAA 
Leu Gin Ser Cys Leu He Cys Val Ala Arg Arg Val Pro Met Lys Glu 

250 255 260. 

AGA CCA GTT CTT CCC TCA TCA GAA ACT TTT ACT ACT CGC CAG GAT CTC 
Arg Pro Val Leu Pro Ser Ser Glu Ser Phe Thr Thr Arg Gin Asp Leu 
265 270 275 

CAA GGC AAG ATC ACG TCT CTG GAT ACC AGC ACC ATG AGA GCA GCC ATG 
Gin. Gly Lys He Thr Ser Leu Asp Thr Ser Thr Met Arg Ala Ala Met 
280 285 290 

AAA CCA GGC TGG GAG GAC CTG GTA AGA AGG TGT ATT CAG AAG TTC CAT 
Lys Pro Gly Trp Glu Asp Leu Val Arg Arg Cys He Gin Lys Phe His 
295 300 305 

GCG CAG CAT GAA GGA GAA TCT GTG TCC TAT GCT AAG AGG CAT CAT CAT 
Ala Gin Kxs Glu Gly Glu Ser Val Ser Tyr Ala Lys Arg His His Hil 



510 



558 



606 



654 



702 



750 



798 



846 



894 



942 



990 



1038 



1086 



1134 
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310 315 320 

GAA GTA CTG AGA CAA GGA TTG GCA TTC AGT CAA ATC TAT CGT TTT TCC 
Glu Val Leu Arg Gin Gly Leu Ala Phe Ser Gin He Tyr Arg Phe Ser 
325 33 0 335 340 

TTG TCT GAT GGC ACT CTT GTT GCT GCA CAA ACG AAG AGC AAA CTC ATC 
Leu Ser Asp Gly Thr Leu Val Ala Ala Gin Thr Lys Ser Lys Leu He 
345 350 355 

CGT TCT CAG ACT ACT AAT GAA CCT CAA CTT GTA ATA TCT TTA CAT ATG 
Arg Ser Gin Thr Thr Asn Glu Pro Gin Leu Val He Ser Leu His Met 
3 60 365 370 

CTT CAC AGA GAG CAG AAT GTG TGT GTG ATG AAT CCG GAT CTG ACT GGA 
Leu His Arg Glu Gin Asn Val Cys Val Met Asn Pro Asp Leu Thr Gly 
375 380 385 

CAA ACG ATG GGG AAG CCA CTG AAT CCA ATT AGC TCT AAC AGC CCT GCC 
Gin Thr Met Gly Lys Pro Leu Asn Pro He Ser Ser Asn Ser Pro Ala 
390 395 400 

CAT CAG GCC CTG TGC AGT GGG AAC CCA GGT CAG GAC ATG ACC CTC AGT 
His Gin Ala Leu Cys Ser Gly Asn Pro Gly Gin Asp Met Thr Leu Ser 
405 41 <> 415 420 

AGC AAT ATA AAT TTT CCC ATA AAT GGC CCA AAG GAA CAA ATG GGC ATG 
Ser Asn He Asn Phe Pro He Asn Gly Pro Lys Glu Gin Met Gly Met 
425 430 435 

CCC ATG GGC AGG TTT GGT GGT TCT GGG GGA ATG AAC CAT GTG TCA GGC 
Pro Met Gly Arg Phe Gly Gly Ser Gly Gly Met Asn His Val Ser Gly 
44 0 445 450 

ATG CAA GCA ACC ACT CCT CAG GGT AGT AAC TAT GCA CTC AAA ATG AAC 
Met Gin Ala Thr Thr Pro Gin Gly Ser Asn Tyr Ala Leu Lys Met Asn 
455 460 465 

AGC CCC TCA CAA AGC AGC CCT GGC ATG AAT CCA GGA CAG CCC ACC TCC 
Ser Pro Ser Gin Ser Ser Pro Gly Met Asn Pro Gly Gin Pro Thr Ser 
470 475 480 

ATG CTT TCA CCA AGG CAT CGC ATG AGC CCT GGA GTG GCT GGC AGC CCT 
Met Leu Ser Pro Arg His Arg Met Ser Pro Gly Val Ala Gly Ser Pro 
485 «0 495 500 

CGA ATC CCA CCC AGT CAG TTT TCC CCT GCA GGA AGC TTG CAT TCC CCT 
.Arg He Pro Pro Ser Gin Phe Ser Pro Ala Gly Ser Leu His Ser Pro 
505 510 515 

GTG GGA GTT TGC AGC AGC ACA GGA AAT AGC CAT AGT TAT ACC AAC AGC 
Val Gly Val Cys Ser Ser Thr Gly Asn Ser His Ser Tyr Thr Ash Ser 
520 525 530 

TCC CTC AAT GCA CTT CAG GCC CTC AGC GAG GGG CAC GGG GTC TCA TTA 
Ser Leu Asn Ala Leu Gin Ala Leu Ser Glu Gly His Gly Val Ser Leu 
535 540 545 



1182 



1230 



1278 



1326 



1374 



1422 



1470 



1518 



1566 



1614 



1662 



1710 



1758 



1806 
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GGG TCA TCG TTG GCT TCA CCA GAC CTA AAA ATG GGC AAT TTG CAA AAC 1854 
Gly Ser Ser Leu Ala Ser Pro Asp Leu Lys Met Gly Asn Leu Gin Asn 
550 555 560 

TCC CCA GTT AAT ATG AAT CCT CCC CCA CTC AGC AAG ATG GGA AGC TTG 1902 
Ser Pro Val Asn Met Asn Pro Pro Pro Leu Ser Lys Met Gly Ser Leu 
565 570 575 580 

GAC TCA AAA GAC TGT TTT GGA CTA TAT GGG GAG CCC TCT GAA GGT ACA 1950 
Asp Ser Lys Asp Cys Phe Gly Leu Tyr Gly Glu Pro Ser Glu Gly Thr 
585 590 595 

ACT GGA CAA GCA GAG AGC AGC TGC CAT CCT GGA GAG CAA AAG GAA ACA 1998 
Thr Gly Gin Ala Glu Ser Ser Cys His Pro Gly Glu Gin Lys Glu Thr 
600 605 610 

AAT GAC CCC AAC CTG CCC CCG GCC GTG AGC AGT GAG AGA GCT GAC GGG 2046 
Asn Asp Pro Asn Leu Pro Pro Ala Val Ser Ser Glu Arg Ala Asp Gly 
615 620 625 

CAG AGC AGA CTG CAT GAC AGC AAA GGG CAG ACC AAA CTC CTG CAG CTG 2094 
Gin Ser Arg Leu His Asp Ser Lys Gly Gin Thr Lys Leu Leu Gin Leu 
630 635 640 

CTG ACC ACC AAA TCT GAT CAG ATG GAG CCC TCG CCC TTA GCC AGC TCT 2142 
Leu Thr Thr Lys Ser Asp Gin Met Glu Pro Ser Pro Leu Ala Ser Ser 
645 650 655 660 

TTG TCG GAT ACA AAC AAA GAC TCC ACA GGT AGC TTG CCT GGT TCT GGG 2190 
Leu Ser Asp Thr Asn Lys Asp Ser Thr Gly Ser Leu Pro Gly Ser Gly 
665 670 675 

TCT ACA CAT GGA ACC TCG CTC AAG GAG AAG CAT AAA ATT TTG CAC AGA 2238 
Ser Thr His Gly Thr Ser Leu Lys Glu Lys His Lys He Leu His Arg 
680 685 690 

CTC TTG CAG GAC AGC AGT TCC CCT GTG GAC TTG GCC AAG TTA ACA GCA 2286 
Leu Leu Gin Asp Ser Ser Ser Pro Val Asp Leu Ala Lys Leu Thr Ala 
695 700 705 

GAA GCC ACA GGC AAA GAC CTG AGC CAG GAG TCC AGC AGC ACA GCT CCT 2334 
Glu Ala Thr Gly Lys Asp Leu Ser Gin Glu Ser Ser Ser Thr Ala Pro 
710 715 720 

GGA TCA GAA GTG ACT ATT AAA CAA GAG CCG GTG AGC CCC AAG AAG AAA 2382 
Gly Ser Glu Val Thr He Lys Gin Glu Pro Val Ser Pro Lys Lys Lys 
725 730 735 740 

GAG AAT GCA CTA CTT CGC TAT TTG CTA GAT AAA GAT GAT ACT AAA GAT 2430 
Glu Asn Ala Leu Leu Arg Tyr Leu Leu Asp Lys Asp Asp Thr Lys Asp 
745 750 755 

ATT GGT TTA CCA GAA ATA ACC CCC AAA CTT GAG AGA CTG GAC AGT AAG 2478 
He Gly Leu Pro Glu He Thr Pro Lys Leu Glu Arg Leu Asp Ser Lys 
760 765 770 

ACA GAT CCT GCC AGT AAC ACA AAA TTA ATA GCA ATG AAA ACT GAG AAG 2526 
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Thr Asp Pro Ala Ser Asn Thr Lys Leu lie Ala Met Lys Thr Glu Lys 
775 780 785 

GAG GAG ATG AGC TTT GAG CCT GGT GAC CAG CCT GGC AGT GAG CTG GAC 2574 
Glu Glu Met Ser Phe Glu Pro Gly Asp Gin Pro Gly Ser Glu Leu Asp 
790 795 800 

AAC TTG GAG GAG ATT TTG GAT GAT TTG CAG AAT AGT CAA TTA CCA CAG 2622 
Asn Leu Glu Glu lie Leu Asp Asp Leu Gin Asn Ser Gin Leu Pro Gin 
805 810 815 820 

CTT TTC CCA GAC ACG AGG CCA GGC GCC CCT GCT GGA TCA GTT GAC AAG 2670 
Leu Phe Pro Asp Thr Arg Pro Gly Ala Pro Ala Gly Ser Val Asp Lys 
825 830 835 

CAA GCC ATC ATC AAT GAC CTC ATG CAA CTC ACA GCT GAA AAC AGC CCT 2718 
Gin Ala lie lie Asn Asp Leu Met Gin Leu Thr Ala Glu Asn Ser Pro 
840 845 850 

GTC ACA CCT GTT GGA GCC CAG AAA ACA GCA CTG CGA ATT TCA CAG AGC 2766 
Val Thr Pro Val Gly Ala Gin Lys Thr Ala Leu Arg lie Ser Gin Ser 
855 860 865 

ACT TTT AAT AAC CCA CGA CCA GGG CAA CTG GGC AGG TTA TTG CCA AAC 2814 
Thr Phe Asn Asn Pro Arg Pro Gly Gin Leu Gly Arg Leu Leu Pro Asn 
870 875 880 

CAG AAT TTA CCA CTT GAC ATC ACA TTG CAA AGC CCA ACT GGT GCT GGA 28 62 

Gin Asn Leu Pro Leu Asp He Thr Leu Gin Ser Pro Thr Gly Ala Gly 
885 890 895 900 

CCT TTC CCA CCA ATC AGA AAC AGT AGT CCC TAC TCA GTG ATA CCT CAG 2910 
Pro Phe Pro Pro He Arg Asn Ser Ser Pro Tyr Ser Val lie Pro Gin 
905 910 915 

CCA GGA ATG ATG GGT AAT CAA GGG ATG ATA GGA AAC CAA GGA AAT TTA 2958 
Pro Gly Met Met Gly Asn Gin Gly Met He Gly Asn Gin Gly Asn Leu 
920 925 930 

GGG AAC AGT AGC ACA GGA ATG ATT GGT AAC AGT GCT TCT CGG CCT ACT 3006 
Gly Asn Ser Ser Thr Gly Met lie Gly Asn Ser Ala Ser Arg Pro Thr 
935 940 945 

ATG CCA TCT GGA GAA TGG GCA CCG CAG AGT TCG GCT GTG AGA GTC ACC 3054 
Met Pro Ser Gly Glu Trp Ala Pro Gin Ser Ser Ala Val Arg Val Thr 
950 955 960 

TGT GCT GCT ACC ACC AGT GCC ATG AAC CGG CCA GTC CAA GGA GGT ATG 3102 
Cys Ala Ala Thr Thr Ser Ala Met Asn Arg Pro Val Gin Gly Gly Met 
965 970 975 980 

ATT CGG AAC CCA GCA GCC AGC ATC CCC ATG AGG CCC AGC AGC CAG CCT 3150 
He Arg Asn Pro Ala Ala Ser He Pro Met Arg Pro Ser Ser Gin Pro 
985 990 995 



GGC CAA AGA CAG ACG CTT CAG TCT CAG GTC ATG AAT ATA GGG CCA TCT 
Gly Gin Arg Gin Thr Leu Gin Ser Gin Val Met Asn He Gly Pro Ser 



3198 
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1000 10 05 



1010 



GAA TTA GAG ATG AAC ATG GGG GGA CCT CAG TAT AGC CAA CAA CAA GCT 
Glu Leu Glu Met Asn Met Gly Gly Pro Gin Tyr Ser Gin Gin Gin Ala 
1015 1020 1025 

CCT CCA AAT CAG ACT GCC CCA TGG CCT GAA AGC ATC CTG CCT ATA GAC 

, Sn*" 1 ^ ^ a Pr ° Trp Pr ° Glu Ser »e Pro lie Zp 

1030 1035 104 0 

CAG GCG TCT TTT GCC AGC CAA AAC AGG CAG CCA TTT GGC ACT TCT CCA 
Gin Ala Ser Phe Ala Ser Gin Asn Arg Gin Pro Phe Gly Ser Ser pS 
1045 1050 1055 1060 

GAT GAC TTG CTA TGT CCA CAT CCT GCA GCT GAG TCT CCG ACT GAT GAG 
Asp Asp Leu Leu Cys. Pro His Pro Ala Ala Glu Ser Pro Ser Asp Glu 
1065 1070 1075 

GGA GCT CTC CTG GAC CAG CTG TAT CTG GCC TTG CGG AAT TTT GAT GGC 
Gly Ala Leu Leu Asp Gin Leu Tyr Leu Ala Leu Arg Asn Phe Asp Glv 
1080 1085 1090 

CTG GAG GAG ATT GAT AGA GCC TTA GGA ATA CCC GAA CTG GTC AGC CAG 
Leu Glu Glu lie Asp Arg Ala Leu Gly He Pro Glu Leu Val Ser Gin 
1095 1100 110 5 



Ser Sf? S 1 ? m ^ ^ TCA AGT CAG GAT TCC AAC ATC 

?n ASP Pr ° G1U Gln Phe Ser Ser G1 " Asp Ser Asn He 

1110 1H5 1120 



ATG CTG GAG CAG ARG GCG CCC GTT TTC CCA CAG CAG TAT GCA TCT CAG 
Met^Leu Glu Gln Lys Ala Pro Val Phe Pro Gln Gln Tyr Ala Ser Gln 

1140 



H25 1130 1135 



t£ ^ ^ ^ AGC TAT TCT CCC ATG GA T CCA AAC TTT 

Ala Gln Met Ala Gln Gly Ser Tyr Ser Pro Met Gln Asp Pro Asn Phe 

II 45 1150 1155 

CAC ACC ATG GGA CAG CGG CCT AGT TAT GCC ACA CTC CGT ATG CAG CCC 
His Thr Met Gly Gln Arg Pro Ser Tyr Ala Thr Leu Arg Met Gln Pro 
1160 1165 H70 

AGA CCG GGC CTC AGG CCC ACG GGC CTA GTG CAG AAC CAG CCA AAT CAA 
Arg Pro Gly Leu Arg Pro Thr Gly Leu Val Gln Asn Gln Pro J£ Sn* 
H75 1180 118S 

™ AGA 7™ £?* CTT ^ ^ CGC CTC ™ CCA CAG CAG AAT CGC CAG 
Leu Arg Leu Gln Leu Gln His Arg Leu Gln Ala Gln Gln Asn Arg Gln 
1190 1195 1200 

CCA CTT ATG AAT CAA ATC AGC AAT GTT TCC AAT GTG AAC TTG ACT CTG 
Pro Leu Met Asn Gln He Ser Asn Val Ser Asn Val Asn Leu Thr £S 
1205 1210 "IS 1220 

AGG CCT GGA GTA CCA ACA CAG GCA CCT ATT AAT GCA CAG ATG CTG GCC 
Arg Pro Gly Val Pro Thr Gln Ala Pro He Asn Ala Gln Met Leu Ala 
1225 1230 123 5 



3246 



3294 



3342 



3390 



3438 



3486 



3534 



3582 



3S30 



3678 



3726 



3774 



3822 



3870 
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CAG AGA CAG AGG GAA ATC CTG AAC CAG CAT CTT CGA CAG AGA CAA ATG 
Gin Arg Gin Arg Glu lie Leu Asn Gin His Leu Arg Gin Arg Gin Met 
1240 1245 1250 

CAT CAG CAA CAG CAA GTT CAG CAA CGA ACT TTG ATG ATG AGA GGA CAA 
His Gin Gin Gin Gin Val Gin Gin Arg Thr Leu Met Met Arg Gly Gin 
1255 1260 . 1265 

GGG TTG AAT ATG ACA CCA AGC ATG GTG GCT CCT AGT GGT ATG CCA GCA 
Gly Leu Asn Met Thr Pro Ser Met Val Ala Pro Ser Gly Met Pro Ala 
1270 1275 1280 

ACT ATG AGC AAC CCT CGG ATT CCC CAG GCA AAT GCA CAG CAG TTT CCA 
Thr Met Ser Asn Pro Arg He Pro Gin Ala Asn Ala Gin Gin Phe Pro 
1285 1290 1295 130 o 

TTT CCT CCA AAC TAC GGA ATA AGT CAG CAA CCT GAT CCA GGC TTT ACT 
Phe Pro Pro Asn Tyr Gly He Ser Gin Gin Pro Asp Pro Gly Phe Thr 
1305 1310 1315 

GGG GCT ACG ACT CCC CAG AGC CCA CTT ATG TCA CCC CGA ATG GCA CAT 
Gly Ala Thr Thr Pro Gin Ser Pro Leu Met Ser Pro Arg Met Ala His 
1320 1325 3.330 

ACA CAG AGT CCC ATG ATG CAA CAG TCT CAG GCC AAC CCA GCC TAT CAG 
Thr Gin Ser Pro Met Met Gin Gin Ser Gin Ala Asn Pro Ala Tyr Gin 
1335 1340 1345 

GCC CCC TCC GAC ATA AAT GGA TGG GCG CAG GGG AAC ATG GGC GGA AAC 
Ala Pro Ser Asp He Asn Gly Trp Ala Gin Gly Asn Met Gly Gly Asn 
1350 1355 13G0 

AGC ATG TTT TCC CAG CAG TCC CCA CCA CAC TTT GGG CAG CAA GCA AAC 
Ser Met Phe Ser Gin Gin Ser Pro Pro His Phe Gly Gin Gin Ala Asn 

"65 1370 t otc 

" /u 1375 1380 

ACC AGC ATG TAC AGT AAC AAC ATG AAC ATC AAT GTG TCC ATG GCG ACC 
Thr Ser Met Tyr Ser Asn Asn Met Asn He Asn Val Ser Met Ala Thr 
1385 1390 1395 

AAC ACA GGT GGC ATG AGC AGC ATG AAC CAG ATG ACA GGA CAG ATC AGC 
Asn Thr Gly Gly Met Ser Ser Met Asn Gin Met Thr Gly Gin He Ser 
1400 1405 1 410 

ATG ACC TCA GTG ACC TCC GTG TCT ACG TCA GGG CTG TCC TCC ATG GGT 
Met Thr Ser Val Thr Ser Val Ser Thr Ser Gly Leu Ser Ser Met Gly 
14 15 1420 1425 

CCC GAG CAG GTT AAT GAT CCT GCT CTG AGG GGA GGC AAC CTG TTC CCA 
Pro Glu Gin Val Asn Asp Pro Ala Leu Arg Gly Gly Asn Leu Phe Pro 
1430 1435 144 0 

AAC CAG CTG CCT GGA ATG GAT ATG ATT AAG CAG GAG GGA GAC ACA ACA 
Asn Gin Leu Pro Gly Met Asp Met He Lys Gin Glu Gly Asp Thr Thr 
1445 1450 1455 1460 

CGG AAA TAT TGC TGACACTGCT GAAGCCAGTT GCTTCTTCAG CTGACCGGGC 



3918 



3966 



4014 



4062 



4110 



4158 



4206 



4254 



4302 



4350 



4398 



4446 



4494 



4542 



4594 



4834 
4894 
4954 
5014 
5074 
5134 
5194 
5254 
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Arg Lys Tyr Cys 

TCACTTGCTC AAAACACTTC CAGTCTGGAG AGCTGTGTCT ATTTGTTTCA AC CCAACTG A 4654 
CCTGCCAGCC GGTTCTGCTA GAGCAGACAG GCCTGGCCCT GGTTCC CAGG GTGGCGTCCA 4714 
CTCGGCTGTG GCAGGAGGAG CTGCCTCTTC TCTTGACAGT CTGAAGCTCG CATCCAGACA 4774 
GTCGCTCAGT CTGTTCCCTG CATTCACCTT AGTGCAACTT AGATCTCTCC TCCCCAAGTA 
AATGTTGACA GGCCAATTTC ATACCCATGT CAGATTGAAT GTATTTAAAT GTATGTATTT 
AAGGAGAACC ATGCTCTTGT TCTGTTCCTG TTCGGTTCCA GACACTGGTT TCTTGCTTTG 
TTTTCCCTGG CTAACAGTCT AGTGCCAAAG ATTAAGATTT TATCTGGGGG AAAGAAAAGA 
ATTTTTTAAA AAATTAAACT AAAGATGTTT TAAGCTAAAG CCTGAATTTG GGATGGAAGC 
AGGACAGACA CCGTGGACAG CGCTGTATTT ACAGACACAC CCAGTGCGTG AAGACCAACA 
AAGTCACAGT CGTATCTCTA GAAAGCTCTA AAGACCATGT TGGAAAGAGT CTCCAGTTAC 
TGAACAGATG AAAAGGAGCC TGTGAGAGGG CTGTTAACAT TAGCAAATAT TTTTTCCTTG 
TTTTTTCTTT GTTAAAACCA AACTGGTTCA CCTGAATCAT GAATTGAGAA GAAATAATTT 5314 

f 

TCATTTCTAA ATTAAGTC CC TTTTAGTTTG ATCAGACAGC TTGAATCAGC ATCTCTTCTT 
CCCTGTCAGC CTGACTCTTC CCTTCCCCTC TCTCATTCCC CATACTCCCT ATTTTCATTC 
CTTTTTTAAA AAATAATATA AGCTACAGAA ACCAGGTAAG CCGTTTATTT CCTTAAATGT 
TTTGCCAGCC ACTTAC CAAT TGCTAAGTAT TGAATTTCAG AAAAAAAAAA TGCATTTACT 
GGCAAGGAGA AGAGCAAAGT TAAGG CTTG A TACCAATCGA GCTAAGGATA CCTGCTTTGG 
AAGCATGTIT ATTCTGTTCC CCAGCAACTC TGGCCTCCAA AATGGGAGAA ACGCCAGTGT 
GTTTAAATTG ATAGCAGATA TCACGACAGA TTTAACCTCT GCCATGTGTT TTTTATTTTG 
TTTTTTAGCA GTGCTGACTA AG CCGAAGTT TTGTAAGGTA CATAAAATCC AATTTATATG 
TAAACAAGCA ATAATTTAAG TTGAGAACTT ATGTGTTTTA ATTGTATAAT TTTTGTGAGG 
TATACATATT GTGGAATTGA CTCAAAAATG AGGTACTTCA GTATTAAATT AGATATCTTC 
ATAGCAATGT CTCCTAAAGG TGTTTTGTAA AGGATATCAA TGCCTTGATT AGACCTAATT 
TGTAGACTTA AGACTTTTTA TTTTCTAAAC CTTGTGATTC TGCTTATAAG TCATTTATCT 
AATCTATATG ATATGCAGCC GCTGTAGGAA CCAATTCTTG ATTTTTATAT GTTTATATTC 
TTTCTTAATG AACCTTAGAA AGACTACATG TTACTAAGCA GGCCACTTTT ATGGTTGTTT 



TT 



5374 
5434 
5494 
5554 
5614 
5674 
5734 
5794 
5854 
5914 
5974 
6034 
6094 
6154 
6156 
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(2) INFORMATION FOR SEQ ID NO: 2: \ 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1464 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Ser Gly Met Gly Glu Asn Thr Ser Asp Pro Ser Arg Ala Glu Thr 
1 5 10 is 

Arg Lys Arg Lys Glu Cys Pro Asp Gin Leu Gly Pro Ser Pro Lys Aro 
20 25 30 

Asn Thr Glu Lys Arg Asn Arg Glu Gin Glu Asn Lys Tyr He Glu Glu 
35 40 45 

Leu Ala Glu Leu He Phe Ala Asn Phe Asn Asp He Asp Asn Phe Asn 
50 55 60 

Phe Lys Pro Asp Lys Cys Ala He Leu Lys Glu Thr Val Lys Gin He 
65 7 0 .75 80 

Arg Gin He Lys Glu Gin Glu Lys Ala Ala Ala Ala Asn He Asp Glu 

85 90 95 

Val Gin Lys Ser Asp Val Ser Ser Thr Gly Gin Gly Val He Asp Lys 
100 105 110 

Asp Ala Leu Gly Pro Met Met Leu Glu Ala Leu Asp Gly Phe Phe Phe 
115 120 125 

Val Val Asn Leu Glu Gly Asn Val Val Phe Val Ser Glu Asn Val Thr 
130 "5 140 

Gin Tyr Leu Arg Tyr Asn Gin Glu Glu Leu Met Asn Lys Ser Val Tyr 
145 1S0 155 160 

Ser He Leu His Val Gly Asp His Thr Glu Phe Val Lys Asn Leu Leu 
165 170 175 

Pro Lys Ser He Val Asn Gly Gly Ser Trp Ser Gly Glu Pro Pro Arg 
180 185 190 

Arg Asn Ser His Thr Phe Asn Cys Arg Met Leu Val Lys Pro Leu Pro 
195 200 205 

Asp Ser Glu Glu Glu Gly His Asp Asn Gin Glu Ala His Gin Lys Tyr 
210 215 220 y y 



Glu Thr Met Gin Cys Phe Ala Val Ser Gin Pro Lys Ser He Lys Glu 
225 230 235 240 

Glu Gly Glu Asp Leu Gin Ser Cys Leu He Cys Val Ala Arg Arg Val 



V 
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245 2S0 



255 

Pro Met Lys Glu Arg Pro Val Leu Pro Ser Ser Glu Ser Phe Thr Thr 
260 265 270 

Arg Gin Asp. Leu Gin Gly Lys He Thr Ser Leu Asp Thr Ser Thr Met 
275 280 285 

Arg Ala Ala Met Lys Pro Gly Trp Glu Asp Leu Val Arg Arg Cys He 

295 300 

Gin Lys Phe His Ala Gin His Glu Gly Glu Ser Val Ser Tyr Ala Lys 

310 315 320 

Arg His His His Glu Val Leu Arg Gin Gly Leu Ala Phe Ser Gin He 
325 330 335 

Tyr Arg Phe Ser Leu Ser Asp Gly Thr Leu Val Ala Ala Gin Thr Lys 
340 345 350 

Ser Lys Leu He Arg Ser Gin Thr Thr Asn Glu Pro Gin Leu Val n e 
55 360 365 

Ser Leu His Met Leu His Arg Glu Gin Asn Val Cys Val Met Asn Pro 

3 ? 5 380 

Asp Leu Thr Gly Gin Thr Met Gly Lys Pro Leu Asn Pro He Ser Ser 

390 395 4Q0 

Asn Ser Pro Ala His Gin Ala Leu Cys Ser Gly Asn Pro Glv Gin Asp 
405 4 10 " 415 

Met Thr Leu Ser Ser Asn He Asn Phe Pro He Asn Gly Pro Lys Glu 
420 425 4 3 0 - 

Gin Met Gly Met Pro Met Gly Arg Phe Gly Gly Ser Gly Gly Met Asn 

440 445 

His Val ser Gly Met Gin Ala Thr Thr Pro Gin Gly Ser Asn Tyr Ala 

455 460 

Leu Lys Met Asn Ser Pro Ser Gin Ser Ser Pro Gly Met Asn Pro Gly 

470 475 480 

Gin Pro Thr Ser Met Leu Ser Pro Arg His Arg Met Ser Pro Gly Val 
485 490 49 5 

Ala Gly ser Pro Arg He Pro Pro Ser Gin Phe Ser Pro Ala Gly Ser 

505 510 

Leu His Ser Pro Val Gly Val Cys Ser Ser Thr Gly Asn Ser His Ser 

520 525 

Tyr Thr Asn Ser Ser Leu Asn Ala Leu Gin Ala Leu Ser Glu Gly His 

535 540 

Gly Val ser Leu Gly Ser Ser Leu Ala Ser Pro Asp Leu Lys Met Gly 
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545 550 ccc 

555 560 

Asn Leu Gin Asn Ser Pro Val Asn Met Asn Pro Pro Pro Leu Ser Lys 
565 570 



575 



Met Gly Ser Leu Asp Ser Lys Asp Cys Phe Gly Leu Tyr Gly Glu Pro 

boO — — — 



585 



590 



Ser Glu Gly Thr Thr Gly Gin Ala Glu Ser Ser Cys His Pro Gly Glu 

600 £05 

Gin Lys Glu Thr Asn Asp Pro Asn Leu Pro Pro Ala Val Ser Ser Glu 

615 620 

Arg Ala Asp Gly Gin Ser Arg Leu His Asp Ser Lys Gly Gin Thr Lys 

630 «3S 640 

Leu Leu Gin Leu Leu Thr Thr Lys Ser Asp Gin Met Glu Pro Ser Pro 
645 650 655 

Leu Ala Ser Ser Leu Ser Asp Thr Asn Lys Asp Ser Thr Glv Ser Leu 
660 665 67b 

Pro Gly Ser Gly Ser Thr His Gly Thr Ser Leu Lys Glu Lys His Lys 

575 680 



685 



He Leu His Arg Leu Leu Gin Asp Ser Ser Ser Pro Val Asp Leu Ala 
o^u 695 



700 



Lys Leu Thr Ala Glu Ala Thr Gly Lys Asp Leu Ser Gin Glu Ser Ser 



710 



715 



720 



Ser Thr Ala Pro Gly Ser Glu Val Thr He Lys Gin Glu Pro Val Ser 
725 730 735 

Pro Lys Lys Lys Glu Asn Ala Leu Leu Arg Tyr Leu Leu Asp Lys Asp 
740 745 750 

Asp Thr Lys Asp He Gly Leu Pro Glu He Thr Pro Lys Leu Glu Arg 

760 765 

Leu Asp ser Lys Thr Asp Pro Ala Ser Asn Thr Lys Leu He Ala Met 

775 780 

Lys Thr Glu Lys Glu Glu Met Ser Phe Glu Pro Gly Asp Gin Pro Gly 

790 795 800 

Ser Glu Leu Asp Asn Leu Glu Glu He Leu Asp Asp Leu Gin Asn Ser 
805 810 815 

Gin Leu Pro Gin Leu Phe Pro Asp Thr Arg Pro Gly Ala Pro Ala Gly 

820 325 1 



830 



Ser Val Asp Lye Gin Ala He lie Asn Asp Leu Met Gin Leu Thr 



840 



Ala 



845 



Glu Asn Ser Pro Val Thr Pro Val Gly Ala Gin Lys Thr Ala Leu Arg 
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850 855 860 

lie Ser Gin Ser Thr Phe Asn Asn Pro Arg Pro Gly Gin Leu Gly Arg 
865 8 7 0 875 880 

Leu Leu Pro Asn Gin Asn Leu Pro Leu Asp lie Thr Leu Gin Ser Pro 
885 890 895 

Thr Gly Ala Gly Pro Phe Pro Pro He Arg Asn Ser Ser Pro Tyr Ser 
900 905 910 

Val He Pro Gin Pro Gly Met Met Gly Asn Gin Gly Met He Gly Asn 
915 920 925 

Gin Gly Asn Leu Gly Asn Ser Ser Thr Gly Met lie Gly Asn Ser Ala 
930 935 940 

Ser Arg Pro Thr Met Pro Ser Gly Glu Trp Ala Pro Gin Ser Ser Ala 
945 950 955 960 

Val Arg Val Thr Cys Ala Ala Thr Thr Ser Ala Met Asn Arg Pro Val 
965 970 9 75 

Gin Gly Gly Met lie Arg Asn Pro Ala Ala Ser lie Pro Met Arg Pro 
980 985 990 

Ser Ser Gin Pro Gly Gin Arg Gin Thr Leu Gin Ser Gin Val Met Asn 
995 1000 1005 

He Gly Pro Ser Glu Leu Glu Met Asn Met Gly Gly Pro Gin Tyr Ser 
1010 10X5 1020 

Gin Gin Gin Ala Pro Pro Asn Gin Thr Ala Pro Trp Pro Glu Ser He 
1025 *030 1035 1040 

Leu Pro He Asp Gin Ala Ser Phe Ala Ser Gin Asn Arg Gin Pro Phe 
1045 1050 1055 

Gly Ser Ser Pro Asp Asp Leu Leu Cys Pro His Pro Ala Ala Glu Ser 
1060 1065 1070 

Pro Ser Asp Glu Gly Ala Leu Leu Asp Gin Leu Tyr Leu Ala Leu Arg 
1075 1080 1085 

Asn Phe Asp Gly Leu Glu Glu He Asp Arg Ala Leu Gly He Pro Glu 
1090 1095 1100 

Leu Val Ser Gin Ser Gin Ala Val Asp Pro Glu Gin Phe Ser Ser Gin 
1105 1110 i i -l c 

iJ " LU H15 1120 

Asp Ser Asn He Met Leu Glu Gin Lys Ala Pro Val Phe Pro Gin Gin 
1125 1130 1135 

Tyr Ala Ser Gin Ala Gin Met Ala Gin Gly Ser Tyr Ser Pro Met Gin 
1140 H 4 5 1150 

Asp Pro Asn Phe His Thr Met Gly Gin Arg Pro Ser Tyr Ala Thr Leu 



WO 98/02455 



PCTYUS97/12100 



-84- 

1155 



1160 1165 
Arg Me^Gln Pro Arg P ro Gly^Leu Arg Pro Thr Gly Leu Val G l„ Asn 



1180 



GlnPro Asn Gin Leu ArgLeu Gin Leu Gin His Arg Leu Gin Ala Gin 

1195 1200 

Gin Asn Arg Gin Pro Leu Met Asn Gin He Ser Asn Val Ser Asn Val 

1-ilQ 1215 



Asn Leu Thr W Arg Pro Gly Val Pro Thr Gin Ala Pro Il e Asn Ala 

1225 1230 
Gin Met LeuAla Gin Arg Gin Arg Glu lie Leu Asn Gin His Leu Arg 



1240 1245 



Gin Arg^Gln Met His Gin Gin Gin Gin Val Gin Gin Arg Thr Leu Met 

1-255 1260 

£t Arg Gly Gin Gly Leu Asn Met Thr Pro Ser Met Val Ala Pro Ser 

1-275 1280 



Gly Met Pro Ala Thr Met Ser Asn Pro Arg He Pro Gin Ala Asn Ala 

1285 12 90 1295 

Gin Gin Phe proFh. Pro Pro Asn TyrGly He Ser Gin Gin Pro Asp 



1310 



Pro Gly Phe Thr Gly Ala Thr Thr Pro Gin Ser Pro Leu Met Ser Pro 

1320 1325 

Arg Met^Ala His Thr Gin Ser Pro Met Met Gin Gin Ser Gin Ala Asn 

1335 1340 



Pro Ali Tyr Gin Ala Pro Ser Asp He Asn Gly Trp Ala Gin Gly Asn 

1350 13S5 136 

Met Gly Gly Asn Ser Met Phe Ser Gin Gin Ser Pro Pro His Phe Gly 



136S " 7 ° 1375 



Gin Gin Ala Asn^Thr Ser Met Tyr Ser Asn Asn Met Asn lie Asn Val 

1385 1390 
Ser Met Ala Thr Asn Thr Gly Gly Met Ser Ser Met Asn Gin Met Thr 
95 1400 1405 

Gly Gin xie Ser Met Thr Ser Val Thr Ser Val Ser Thr Ser Gly Leu 

1415 1420 

14 e 25 Ser G 43 Q Gln Val *•» »» Pro Ala Leu Arg Gly Gly 

1430 143 5 1440 

Asn Leu Phe Pro Asn Gin Leu Pro G1 y Met Asp Met He Lys Gin Glu 

1445 "50 1455 

Gly Asp Thr Thr Arg Lys Tyr Cys 
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1460 

) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1036 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 



Met Ser lie Pro Arg Val Asn Pro Ser 
1 5 10 



Val Asn Pro Ser lie Ser Pro 

15 



Ala His Gly Val Ala Arg Ser Ser Thr Leu Pro Pro Ser Asn Ser Asn 
20 25 30 

Met Val Ser Thr Arg He Asn Arg Gin Gin Ser Ser Asp Leu His Ser 
35 40 45 

Ser Ser His Ser Asn Ser Ser Asn Ser Gin Gly Ser Phe Gly Cys Ser 
50 55 60 

Pro Gly Ser Gin He Val Ala Asn Val Ala Leu Asn Lys Gly Gin Ala 
65 70 75 80 

Ser Ser Gin Ser Ser Lys Pro Ser Leu Asn Leu Asn Asn Pro Pro Met 
85 90 g5 

Glu Gly Thr Gly He Ser Leu Ala Gin Phe Met Ser Pro Arg Arg Gin 
100 105 110 

Val Thr Ser Gly Leu Ala Thr Arg Pro Arg Met Pro Asn Asn Ser Phe 
115 120 12 5 

Pro Pro Asn He Ser Thr Leu Ser Ser Pro Val Gly Met Thr Ser Ser 
130 135 140 

Ala Cys Asn Asn Asn Asn Arg Ser Tyr Ser Asn He Pro Val Thr Ser 
145 150 155 160 

Leu Gin Gly Met Asn Glu Gly Pro Asn Asn Ser Val Gly Phe Ser Ala 
165 170 1?5 

Ser Ser Pro Val Leu Arg Gin Met Ser Ser Gin Asn Ser Pro Ser Arq 
180 185 190 

Leu Asn He Gin Pro Ala Lys Ala Glu Ser Lys Asp Asn Lys Glu He 
195 200 205 



Ala Ser Thr Leu Asn Glu Met He Gin 



Ser Asp Asn Ser Ser Ser Asp 
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210 215 220 

Gly Lys Pro Leu Asp Ser Gly Leu Leu His Asn Asn Asp Arg Leu Ser 
225 230 235 240 

Asp Gly Asp Ser Lys Tyr Ser Gin Thr Ser His Lys Leu Val Gin Leu 
245 255 

Leu Thr Thr Thr Ala Glu Gin Gin Leu Arg His Ala Asp lie Asp Thr 
260 265 270 

Ser Cys Lys Asp Val Leu Ser Cys Thr Gly Thr Ser Asn Ser Ala Ser 
275 280 

Ala Asn Ser Ser Gly Gly Ser Cys Pro Ser Ser His Ser Ser Leu Thr 
290 295 3 0 o 

Ala Arg His Lys He Leu His Arg Leu Leu Gin Glu Gly Ser Pro Ser 

310 315 320 

Asp He Thr Thr Leu Ser Val Glu Pro Asp Lys Lys Asp Ser Ala Ser 
325 330 335 

Thr Ser Val Ser Val Thr Gly Gin Val Gin Gly Asn Ser Ser He Lys 
340 345 350 

Leu Glu Leu Asp Ala Ser Lys Lys Lys Glu Ser Lys Asp His Gin Leu 

355 . , . 360 365 

Leu Arg Tyr Leu Leu Asp Lys Asp Glu Lys Asp Leu Arg Ser Thr Pro 
370 37 5 380 

Asn Leu Ser Leu Asp Asp Val Lys Val Lys Val Glu Lys Lys Glu Gin 

390 395 4 oo 



Met Asp Pro Cys Asn Thr Asn Pro Thr Pro Met Thr Lys Ala Thr Pro 

405 41 ° 415 

Glu Glu lie Lys Leu Glu Ala Gin Ser Gin Phe Thr Ala Asp Leu Asp 
420 4 25 430 

Gin Phe Asp Gin Leu Leu Pro Thr Leu Glu Lys Ala Ala Gin Leu Pro 
435 440 445 

Gly Leu Cys Glu Thr Asp Arg Met Asp Gly Ala Val Thr Ser Val Thr 
3U 4 55 46 o 

lie Lys Ser Glu He Thr He Lys Ser Glu He Leu Pro Ala Ser Leu 

475 480 

Gin Ser Ala Thr Ala Arg Pro Thr Ser Arg Leu Asn Arg Leu Pro Glu 

485 



490 495 

Leu Glu Leu Glu Ala He Asp Asn Gin Phe Gly Gin Pro Gly Thr Gly 
500 505 S1Q 

Asp Gin He Pro Trp Thr Asn Asn Thr Val Thr Ala lie Asn Gin Ser 
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515 



520 



525 



Lys Ser Glu Asp Gln Cys He Ser Ser Gin Leu Asp Glu ^ Leu 

535 540 
Pro Pro Thr Thr val Glu Oly Arg Asn ^p Glu Lys Ala ^ ^ 

560 

Gin Leu val ser Phe Leu Ser Gly Lys Asp Glu Thr Glu Leu Ala olu 

^ ^ *** L6U Gly Ile **- I-u Val Gin Gly Gly Gly Leu 

585 590 

*sp Val Leu ser Glu Arg Phe Pro Pro Gin. Gin Ala Thr Pro Pro Leu 

600 605 
XI. Met Glu Glu Arg Pro Asn Leu Tyr Ser G ln Pro Tyr Ser Ser Pro 



615 620 



Ph. Pro Thr Ala Asn Leu Pro Ser Pro Phe Gin Gly Met Val Arg Gin 



635 640 



LyS . LSU £j Pr ° Val Gln Val Th - *«» Arg Gly 

650 655 

Ala Phe Ser Pro Gly Met Gly Met Gln Pro Arg Gln T*r Leu Asn Arg 

665 670 

Pro Pro Ala Ala Pro Asn Gln Leu Arg Leu Gln Leu Gln Gln Arg Leu 

680 685 

Gln Gly G1 „ Oln Ota u» lie His «!» Asn ^ 9 01n M , „. ^ _ 

Gin Phe Ala Ala Thr ^ Pro Val Gly 11& Agn ^ ^ g ^ ^ ^ 

715 72Qi 

Gln Gin Gln He Thr Pro Gln Pro Pro Leu Asn Ala Gln Met Leu Ala 

25 730 735 

Gln Arg Gln Arg Glu Leu Tyr Ser Gln Gln His Arg Gln Arg Gln Leu 

745 750 

He Gln Gin Gln Arg Ala Met Leu Met Arg Gln Gln Ser Phe Gly Asn 

760 76s 

Asn Leu Pro Prp Ser Ser Gly Leu Pro Val Gln Thr Gly Asn Pro Arg 

775 780 
«« To Gln OXy Ma Pro Gl „ G1 „ Phe Pro ^ ^ ^ ^ ^ 

795 800 
Thr Asn Pro Gly Thr Pro Pro Ala Ser Thr Ser Pro Phe Ser Gln Leu 

810 815 
Ala Ala Asn Pro Glu Ala Ser Leu Ala Asn Arg Asn Ser Met Val Ser 
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820 825 8 3o 

Arg Gly Met Thr Gly Asn lie Gly Gly Gin Phe Gly Thr Gly lie Asn 
835 840 845 

Pro Gin Met Gin Gin Asn Val Phe Gin Tyr Pro Gly Ala Gly Met Val 
850 855 860 

Pro Gin Gly Glu Ala Asn Phe Ala Pro Ser Leu Ser Pro Gly Ser Ser 
865 870 875 8 80 

Met Val Pro Met Pro He Pro Pro Pro Gin Ser Ser Leu Leu Gin Gin 
88 5 890 895 

Thr Pro Pro Ala Ser Gly Tyr Gin Ser Pro Asp Met Lys Ala Trp Gin 
900 9 05 910 

Gin Gly Ala He Gly Asn Asn Asn Val Phe Ser Gin Ala Val Gin Asn 
915 920 925 

Gin Pro Thr Pro Ala Gin Pro Gly Val Tyr Asn Asn Met Ser He Thr 
930 935 940 

Val Ser Met Ala Gly Gly Asn Thr Asn Val Gin Asn Met Asn Pro Met 
945 950 955 9 6 o 

Met Ala Gin Met Gin Met Ser Ser Leu Gin Met Pro Gly Met Asn Thr 
965 970 9?5 

Val Cys Pro Glu Gin He Asn Asp Pro Ala Leu Arg His Thr Gly Leu 
980 9 85 99Q 

Tyr Cys Asn Gin Leu Ser Ser Thr Asp Leu Leu Lys Thr Glu Ala Asp 
"5 1000 . 1.005 

Gly Thr Gin Gin Val Gin Gin Val Gin Val Phe Ala Asp Val Gin Cvs 
1010 ioi5 1020 

Thr Val Asn Leu Val Gly Gly Asp Pro Tyr Leu Asn 
1025 1030 103 5 

INFORMATION FOR SEQ ID NO:4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Glu Arg Ala Asp Gly Gin Ser Arg Leu His Asp Ser Lys Gly Gin Thr 
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10 



15 



Lys Leu Leu Gin Cys 
20 

(2) INFORMATION FOR SEQ ID NO: 5; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Gly His Lys Lys Leu Leu Gin Leu Leu Thr Cys Ser Ser His Gly Ser 
5 10 15 

Leu Leu Gin Glu Lys His Arg He Leu His Lys Leu Leu Gin Asn Gly 
20 2 5 30 

Asn Asn Ala Leu Leu Arg Tyr Leu Leu Asp Arg Asp Aso Pro Ser Asp 
35 4 0 ^ 



45 



(2) INFORMATION FOR SEQ ID NO: 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
Arg Ser He Leu Thr Ser Leu Leu Leu Asn Ser Ser 

12) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 
Phe Asn Val Leu Lys Gin Leu Leu Leu Ser Glu Asn 



(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE : peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 
Ser Ala Thr Leu Arg Ser Leu Leu Leu Asn Pro His 

(2) INFORMATION FOR SEQ ID NO: 9 : 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 58 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 



Leu 
1 



Arg Asn Ser Leu Asp Asp Leu 
5 



Leu Gly Pro Pro Ser Asn Ala Glu 
10 15 



Gly 



Gin Ser Asp Glu Arg Ala Leu 
20 



Leu Asp Gin Leu His Thr Phe Leu 
25 30 



Ser 



Asn Thr Asp Ala Thr Gly Leu 
35 40 



Glu Glu lie Asp Arg Ala Leu Gly 



45 



He 



Pro Glu Leu Val Asn Gin Gly 



Gin Ala 




(2) INFORMATION FOR SEQ ID NO: 10: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 64 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
<D) TOPOLOGY: linear 

(ii) MOLE COLE TYPE : cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

AGACCTGTTG AACTTTGCAA AGGCAAGGGC AGTTCCTTTG AGCTGGGCTT ATGACCTTTG 60 
ACTC ■ 

64 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NOrll: 
CGGAGGACAG TCCTCCGGCG GCCGCGGTCA CAGTGACC 

3 8 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Leu Xaa Xaa Leu Leu 
1 5 

(2) INFORMATION FOR SEQ ID NO:13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: not relevant 
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(ii) MOLECULE TYPE : peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 

Leu Xaa Xaa Leu Leu Leu 
1 5 

2 ) INFORMATION FOR SEQ ID NO : 14 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Leu Leu Xaa Xaa Leu Xaa Xaa Xaa Leu 
1 5 
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Whatls Claimed Is: 



1. An isolated nucleic acid molecule comprising a polynucleotide 
selected from the group consisting of: 

(a) a polynucleotide encoding a polypeptide having an amino 
acid sequence as shown in Figure 1 (SEQ ID NO:2); 

(b) a polynucleotide encoding a polypeptide having an amino 
acid sequence as encoded by the cDNA contained in ATCC Deposit No. 97612; 

(c) a polynucleotide encoding a TIF2 polypeptide fragment 
comprising amino acids 624-869 as shown in Figure 1 (SEQ ID NO:2); 

(d) a polynucleotide encoding a TIF2 polypeptide fragment 
comprising amino acids 624-1 13 1 as shown in Figure 1 (SEQ ID NO: 2); 

(e) a polynucleotide encoding a TIF2 polypeptide fragment 
comprising amino acids 1010-1 131 as shown in Figure 1 (SEQ ID NO:2); 

(f) a polynucleotide that encodes a TEF2 polypeptide fragment 
comprising amino acids 1288-1464 in Figure 1 (SEQ ID NO:2); 

(g) a polynucleotide having a nucleotide sequence at least 90% 
identical to the nucleotide sequence of any of the polynucleotides of (a)-(f); 

(h) a polynucleotide that hybridizes under stringent conditions 
to any of the polynucleotides of (a)-(g) or the complement thereof; 

(i) a polynucleotide fragment of any of the polynucleotides of 
(a)-(h), wherein said fragment is at least 15 bp in length; and 

0) a polynucleotide having a nucleotide sequence 
complementary to the nucleotide sequence of any of the polynucleotides of 
(a)-(i). 



2. The isolated nucleic acid molecule of claim 1, wherein said 
polynucleotide is DNA. 
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3. The isolated nucleic acid molecule of claim 1; wherein said 
polynucleotide is RNA. 

4. The isolated nucleic acid molecule of claim 1, wherein said 
polynucleotide encodes a cytoplasmic TIF2 polypeptide fragment. 

5. The isolated nucleic acid molecule of claim 4 y wherein said TIF2 
polypeptide fragment is selected from the group consisting of TIF2.1, TDF2.2 
TIF2.3, TIF2.4, TEF2.5. TEF2.6, TIF2.7, TIF2.8, TIF2.9, and TIF2.12. 

6. A method for making a recombinant vector comprising inserting 
the isolated nucleic acid molecule of claim 1 into a vector. 

7. A recombinant vector produced by the method of claim 6. 

8. A method of making a recombinant host cell comprising 
introducing the recombinant vector of claim 7 into a host cell. 

9. A recombinant host cell produced by the method of claim 8. 

10. A recombinant method for producing a polypeptide, comprising 
culturing the recombinant host cell of claim 9. 

11. An isolated polypeptide selected from the group consisting of: 

(a) a polypeptide having an amino acid sequence of the TIF2 
polypeptide comprising the complete amino acid sequence as shown in Figure 1 
(SEQ ID NO:2); 

(b) a polypeptide having an amino acid sequence comprising 
residues 624 to 869 as shown in Figure 1 (SEQ ID NO:2); 
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(c) a polypeptide having an amino acid sequence comprising 
residues 624-1 13 1 as shown in Figure 1 (SEQ ID NO:2); 

(d) a polypeptide having an amino acid sequence comprising 
residues 1010-1 131 as shown in Figure 1 (SEQ ID NO:2); 

(e) a polypeptide having an amino acid sequence comprising 
residues 1288-1464 as shown in Figure 1 (SEQ ID NO:2); 

(f) a polypeptide which is at least 90% identical to any of the 
polypeptides of (a-e); 

(g) a polypeptide having an amino acid sequence comprising 
residues 641-645 as shown in Figure 1 (SEQ ID NO:2); 

(h) a polypeptide having an amino acid sequence comprising 
residues 690-694 as shown in Figure 1 (SEQ ID NO:2); and 

(0 a polypeptide having an amino acid sequence comprising 
residues 745-749 as shown in Figure 1 (SEQ ID NO:2). 

12. The isolated polypeptide of claim 1 1, wherein said polypeptide is 
a cytoplasmic TIF2 polypeptide fragment. 

13. The isolated polypeptide of claim 12, wherein said fragment is 
selected from the group consisting of TIF2.1, TIF2.2, TJF2.3, TBF2.4, TIF2.5 
TIF2.6, TIF2.7, TIF2.8, TIF2.9, and TBF2.12. 

14. A screening method for identifying a nuclear receptor (NR) 
antagonist, which comprises: 

(a) providing a host cell containing recombinant genes which 
express a polypeptide comprising a NR ligand binding domain (LBD) and a 
polypeptide comprising transcriptional intermediary factor-2 (TDF-2) or a TIF-2- 
fragment, wherein, in the presence of an agonist, said TEF-2 and said TIF-2- 
fragment bind said NR LBD and are capable of activating transcription; 

(b) administering a candidate antagonist to said cell; and 
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(c) determining whether said candidate antagonist reduces 
either: (1) TIF-2- or TIF-2-fragment-binding to the AF-2 of said NR LBD as 
compared to said binding in the absence of said candidate antagonist; or (2) TIF- 
2- or TIF-2-fragment-stimulated NR LBD AF-2-mediated transaction as 
compared to said transactivation in the absence of said candidate antagonist. 

15. A screening method for identifying a nuclear receptor (NR) 
agonist, which comprises: 

(a) providing a host cell containing recombinant genes which 
express a polypeptide comprising a NR ligand binding domain (LBD) and a 
polypeptide comprising transcriptional intermediary factor-2 (TEF-2) or a TIF-2- 
fragment, wherein, in the presence of an agonist, said TIF-2 and said TIF-2- 
fragment bind said NR LBD; 

(b) administering a candidate agonist to said cell; and 

(c) determining whether said candidate agonist enhances either: 
(1) TIF-2- or TIF-2-fragment-binding to the AF-2 of said NR LBD as compared 
to said binding in the absence of said candidate agonist; or (2) TIF-2- or TIF-2- 
fragment-stimuiated NR LBD AF-2-mediated transactivation as compared to said 
transactivation in the absence of said candidate agonist. 

1 6. The method of claim 14, wherein said host cell further expresses 
a reporter gene. 

1 7. The method of claim 15, wherein said host cell further expresses 
a reporter gene. 

18. The method of claim 14, wherein said host cell expresses a 
polypeptide comprising a NR LBD and GAL4. 
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19. The method of claim 15, wherein said host cell expresses a 
polypeptide comprising a NR LBD and GAL4. 



20. An isolated antibody that binds specifically to an isolated 
polypeptide according to claim 1 1 . 
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